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Abstract. We show that a random endomorphism of a free group gives rise to 
an HNN extension which contains a closed surface subgroup, with probability 
one; a special case is the HNN extension associated to the endomorphism of a 
rank 2 free group (considered by Sapir) sending a to ab and b to ba. 

Wc further show that a group obtained by doubling a free group along any 
collection of subgroups (of arbitrary rank) has the property that every rational 
2-dimensional homology class is virtually represented by surface subgroups, 
and the unit ball in the Gromov norm is a finite sided rational polyhedron. 

These results are obtained by a mixture of combinatorial, geometric and 
linear programming techniques. We obtain other related results of a more 
technical nature concerning stable commutator length in free groups, especially 
concerning the structure of the space of so-called counting quasimorphisms 
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1. Introduction 

1.1. Gromov's surface subgroup question. The following well-known question 
is usually attributed to Gromov: 

Question 1.1 (Gromov). Let G be a one-ended hyperbolic group. Does G contain 
the fundamental group of a closed surface with x < ? 

Hereafter we abbreviate "fundamental group of a closed surface with \ < 0" 
to "surface group" , so that this question asks whether every one-ended hyperbolic 
group contains a surface subgroup. This question is wide open in general, but a 
positive answer is known in certain special cases, including: 
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(1) Coxeter groups (Gordon-Long-Reid [21]): 

(2) Graphs of free groups with cyclic edge groups and 62 > (Calegari [7]); 

(3) Fundamental groups of hyperbolic 3-manifolds (Kahn-Markovic [27]); 

(4) Certain doubles of free groups (Kim- Wilton, Kim-Oum [29l [28] ); 
(this list is not exhaustive). 

One of the goals of this paper is to describe how linear programming may be used 
to settle the question of the existence of surface subgroups in certain graphs of free 
groups, either by giving a powerful computational tool to find surface subgroups in 
specific groups, or by reducing the analysis of this question in infinite families of 
groups to a finite (tractable) calculation. 

We are able to prove the existence of surface subgroups in the following groups: 

(1) A group G with 6 2 > obtained by doubling a free group F along a finite 
collection of finitely generated subgroups Fi (see § I6.3j) : 

(2) A group G obtained as an HNN extension F*^ where F is a free group of 
fixed rank and (f> is a random endomorphism (see § 18. 3j) : 

(3) "Sapir's group" C = F*^ for F = (a, b) and <f> : a -> ab,b ->• ba (see § 18"!]) . 
One sense in which this constitutes a significant advance over the results in [7] [29] [28] 
(for instance) is that the edge groups are free groups of arbitrary rank, whereas in 
the cited papers the edge groups were required to be cyclic. 

Bullet (1) above is implied by a stronger result about the Gromov norm on H2 of 
the double of F along the Fi ; we discuss this stronger result in the next subsection. 
Bullet (3) is reasonably self-explanatory. A precise statement of bullet (2) is: 

Random /-folded Surface Theorem 18.91 Let k > 2 be fixed, and let F be a 
free group of rank k. Let (f> be a random endomorphism of F of length n. Then the 
probability that F*^ contains an essential surface subgroup is at least 1 — 0(C~ n ) 
for some C > 1 and c > 0. 

Here a random endomorphism of length n is one that takes the generators to 
reduced words of length n chosen independently and randomly with the uniform 
distribution. We became interested in surface subgroups of HNN extensions of 
free groups after discussions with Mark Sapir, who conjectured that the subgroup 
C does not contain a surface subgroup, and thought it was unlikely that many 
HNN extensions should contain surface subgroups (other than 1? subgroups for 
cndomorphisms fixing a nontrivial conjugacy class). Therefore it seems safe to say 
that the Random /-folded Surface Theorem is in many ways very unexpected. 

The reader who is interested only in Gromov's surface subgroup question can read 
§[8] with minimal reference to the rest of the paper. §[6] depends more substantially 
on the material in § [3] 

1.2. Stable commutator length and Gromov norm. We are interested not 
only in the existence of surface subgroups, but also in their quality. Fix a group G. 
Every integral 2-dimensional homology class (resp. relative 2-dimensional homology 
class) in G is represented by a nontrivial — but not necessarily injective — map 
from a surface subgroup (resp. surface with boundary subgroup). The complexity 
of such a surface is measured by — y/2, and the infimum of the complexity of the 
surfaces projectively representing a given homology class is 1/4 times the Gromov 
norm of the class (resp. the scl norm of the relative class). A surface realizing the 
infimum may not exist, but if it does, the map is injective. We distinguish such 
special surface subgroups from more general ones by calling them extremal. 
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If G is a graph of free groups, we can find a surface subgroup of G as an amalgam 
of surface subgroups with boundary in the free factors. Thus one way to find an 
extremal subgroup of G is to build it out of subgroups with boundary extremal in 
the factors. To specify a 2-dimensional homology class necessitates the choice of 
a homologically trivial family T of conjugacy classes in G. Then H2{G,T) is an 
affine space over i?2(G) (for instance, if ^(G) is trivial, H2(G,T) consists of a 
single class). If G is a graph of free groups amalgamated over cyclic subgroups, 
there are natural candidates for T, namely (powers of) the cyclic generators of the 
edge groups. But for amalgams over higher rank groups, there are infinitely many 
conjugacy classes to consider, even up to taking powers. 

For F a free group and T a specific homologically trivial family of conjugacy 
classes, the scl norm of the nontrivial class in H2(G,T) is just the (usual) stable 
commutator length of T, and denoted scl(r) (see [8] for an introduction to this 
subject). In [5] an algorithm is given to compute scl(r) in a free group, and the 
algorithm proves that the infimum is realized by an extremal surface. One of the 
main results in this paper is an extension of this algorithm to minimize scl(r) over 
certain infinite families T; roughly speaking, those parameterized by weights on a 
traintrack. 

The main theorem we prove along these lines is the Traintrack Rationality The- 
orem: 

Traintrack Rationality Theorem 13.241 For w 6 W + , there is a formula 

scl(u;) = inf{— x(y)/2 : y £ Y + and Dsiy) = w} 

Moreover, the infimum is achieved, and if w is rational, the infimum is achieved at 
a rational y. Furthermore, the function sc\(w) varies piecewise rational linearly on 
W+. 

Here W + denotes the non-negative homologically trivial weights on a traintrack 
over a rose for F (for precise definitions, see §[3]). 

A double of such an extremal surface with boundary is extremal in the double, 
so we obtain the following theorem: 

Double Norm Theorem 16.51 Let F be a free group, and Fi a finite collec- 
tion of finitely generated subgroups. Let G be the double of F along the Fi; i.e. 
G := D(F, {Fi}). Then the unit ball in the Gromov norm on H2(G) is a rational 
polyhedron, and every rational class in H2 (G) is projectively represented by a finite 
sum of injective closed surface subgroups. 

The same methods allow one to approximate in a natural way the (polyhedral) 
unit ball in the scl norm in the infinite dimensional space (F) for F a free 
group by a natural sequence of (polyhedral) unit balls in finite dimensional normed 
spaces. The definition of these finite dimensional spaces, and the computation of 
the norms, is entirely effective, and we describe an explicit and efficient algorithm 
to do so. 

1.3. Counting functions and combinatorics. Stable commutator length is a 
quantitative version of relative 2-dimensional homology. It is dual to 2-dimensional 
bounded cohomology, as introduced by Gromov [24j . More precisely, for any group 
G there is an exact sequence 

-> H\G) -+Q(G)^> Hl{G) -> H 2 (G) 
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where Q denotes the vector space of homogeneous quasimorphisms, i.e. functions 
4> : G — > M satisfying <fi(g n ) — n(j)(g), and \4>{gh) — <j>(g) — 4>{h)\ < D(<j>) for some 
least constant D((f>) (called the defect), for any g,h G G. The defect defines a norm 
on Q/H 1 , and this normed vector space is the isometric dual of (G) with (twice) 
its scl (pseudo)-norm. 

Even in a free group, our understanding of Q is far from complete. Topologically, 
Q(F)/H 1 (F) is a nonseparable Banach space, and it is probably hopeless to give 
an explicit description of its geometry. However one may aim to give an explicit 
description of a subset of Q large enough to witness the geometry of B^ by operator 
duality. That is to say: given b G B± , we would like to be able to construct some 
explicit <j> G Q with scl(6) = <fi(b)/2D(<fi); such a cf> is said to be extremal for b. 

A natural class of explicit homogeneous quasimorphisms are the so-called count- 
ing quasimorphisms, first introduced by Rhcmtulla [32] , and then further discussed 
by Brooks [5]. For a word a, let C a (b) count the number of copies of a appearing 
in b. A counting quasimorphism is a linear combination of functions of the form 
C a — C a . If we bound the length of the words a appearing in such a linear 
combination by I, we get a space Qp of homomgeneous quasimorphisms. In §|4]we 
construct an explicit family of finite dimensional vector spaces Wg/e with norms 
|| • ||^, and a compatible family of linear projections ^ i : B^ — > Wp for which the 
following duality holds: 

Counting Quasimorphism Duality Theorem 14.241 For any £ and any b G 

B/e, there is an equality 

m(b)\u= ^p 

It is reasonable to conjecture that scl(6) = lim^oo ||\f^(b)||£, and in fact that 
equality is achieved for all sufficiently large I. 

By breaking the asymmetry of the Qf, and considering the broader class of all 
counting functions (i.e. linear combinations of C a as above) we are able to detect 
the full geometry of the scl norm ball. This depends on the notion of posimor- 
phisms, which are related to quasimorphisms roughly as subharmonic functions are 
to harmonic functions; see Definition 14.111 or Theorem 14.191 for the analog of the 
Counting Quasimorphism Duality Theorem for (counting) posimorphisms. 

The unit balls in Qi/H 1 are rational polyhedra. In §[5] we go some way towards 
giving a description of these polyhedra. First, for £ = 2 and arbitrary rank, we 
are able to give a complete description of the codimension one faces. Fix S a 
symmetric free generating set for F, so if F has rank fc, the set S has 2k elements. 
Each cyclically ordered triple of distinct elements of F determines a codimension 1 
face of Q2/H 1 , called a tripod face. The unit ball has the following description: 

Unit Ball Theorem 15. lit The codimension 1 faces of the unit ball in Q2/FI 1 are 
exactly the set of length 1 tripod faces. The symmetric group S^k acts on this set 
by permutation of edge labels, and the resulting action on Q2/ H 1 is by isometries. 
As a representation of S2k> the space Q2/H 1 is isomorphic to A 2 V where V is the 
standard (2k — 1) -dimensional representation. 

The simplicity of the statement of the Unit Ball Theorem obscures how surprising 
it really is; only the subgroup Z/2Z fc x Sk of S2k acts on S by automorphisms. The 
fact that the full group S2k of permutations acts even linearly on Q2/H 1 , let alone 
by isometries, is very surprising and entirely unexpected. 
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The generalization to bigger i is less complete. It turns out that there are 
natural analogs of tripod faces in Qg, and these are a complete set of supporting 
hyperplanes for the unit ball. However for I > 3 it is a redundant collection, and it 
is more difficult to give a finite description of the unit ball analogous to the I — 2 
case. On the other hand, we are able to give a conditional answer to the problem of 
finding an explicit extremal quasimorphism dual to a given chain T. We are able to 
show (Corollary I5.6j) that for any b G , the (possibly empty) set of (ft e Qi/H 1 
extremal for T can be computed explicitly and directly from the combinatorics of 
any extremal surface for T. In words: an extremal surface has as a spine a certain 
kind of fatgraph; this fatgraph determines a collection of tripods, namely the tripods 
which immerse into the fatgraph. The (ft £ Qi/H 1 extremal for L are then precisely 
the cone on the intersection of the associated tripod faces (if nonempty). For more 
details, see § 15.21 

1.4. Geometric endomorphisms, and flat surfaces. In §|H]we investigate the 
class of extremal and flat surfaces in HNN extensions of free groups. An extremal 
surface is one which realizes the Gromov norm of some class in ^(F*^). We 
interpret the (Gromov) norm of this class as the translation length of eft on a certain 
(infinite dimensional) affine normed space modeled on B/e. Moreover, for random 
endomorphisms (ft, lower bounds for this norm can be obtained by considering the 
translation length on finite dimensional normed spaces modeled on the various 
Wt- If equality holds, extremal surfaces exist, and can be certified by counting 
quasimorphisms . 

These affine actions factor through a semidirect product of the form X G for 
finite groups G; we thereby obtain homomorphisms from certain subsemigroups of 
End(_F) to (abelian) groups of translations on finite dimensional vector spaces. This 
can be thought of in a natural way as an analog of (a summand of) the Johnson 
homomorphism on the Torelli subgroup of a mapping class group. In the particular 
special case of flat surfaces the analogy is very tight. 

1.5. The role of linear programming. What is the role of linear programming 
in all this? The essence of our method depends on the fact that injectivity in hyper- 
bolic groups can often be certified by local data. Thus building a surface subgroup 
in a hyperbolic group reduces to many simple "subproblems" . These problems are 
typically interrelated in a complicated (but also local) way, but sometimes different 
tools (combinatorics, ergodic theory) can be used to reduce a large collection of 
"weakly coupled" subproblems to a large collection of independent subproblems, 
parameterized by linear data in the form of linear equalities and inequalities. This 
last system can be solved with linear programming (hereafter abbreviated to LP). 

The power and sophistication of modern practical algorithms for LP problems 
thus frequently lets us reduce a theoretical question concerning infinitely many 
groups to a concrete LP problem, and then to answer it. Note that the finite LP 
problem to which the theoretical question reduces might itself be quite substantial. 
For an example, see the proof of the Random /-folded Surface Theorem in § [5J 
especially the discussion in § 18.3.61 

2. Stable commutator length 

In this section we recall some basic elements from the theory of stable commuta- 
tor length, especially in free groups. For a reference, see [8J, especially Chapter 2. 
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The main novelty in this section is our treatment of homogenization. We are 
careful to define homogenization (of group chains, 1-manifolds or surfaces) to be 
the equivalence relation which equates taking an (oriented) n-fold cover with mul- 
tiplication by n for positive n. 

Throughout this section and the remainder of the paper, we use the notation 
R[*] for the R-vector space with basis a given generating set *, and by R + [*] the 
positive orthant. This is an R-vector space with a canonical basis, so it makes 
sense to talk about integral/rational/positive vectors. We also use the notation 
Z[*], Z + [*] and so on. 

2.1. Chains and antisymmetry. We fix a free group F and a free generating 
set a, 6, c, • • • . We denote inverse by capitalization, hence A := a -1 and so on. 
Let S — {a, A,b, B, ■ ■ ■} denote the symmetrized generating set. For simplicity, we 
work over R in what follows. Most of what we say makes sense for an arbitrary 
group G, but it is convenient to focus on the case of free groups. 
The proof of the following lemma is obvious: 

Lemma 2.1. Conjugacy classes in F are in bijection with reduced cyclic words in 
the generators. 

We let C denote the free R-vector space generated by primitive conjugacy classes 
in F, and let C + denote the positive orthant. There is a homogenization map from 
K + [_F] — > C + sending g n — > ng for n positive, and this map extends by linearity to 
R[F] -+ C. 

There is an abelianization map ab : C — > Hi, and its restriction to C + is 
surjective. 

Definition 2.2. Define e : C — > C to be the involution defined on conjugacy classes 
by e : g -> -g~ x . 

Lemma 2.3. The map ab factors through e, and there is a surjection o/R + modules 
C+ -> C/e. 

Proof. Since a = —A in Hi and so on, the first statement is obvious. The second 
statement is just the observation that every element of C/e has an expression as a 
formal sum with all coefficients positive. □ 

Remark 2.4. We call an element of C + primitive if it does not contain a term of 
the form t(c + c _1 ) with t > 0. Every element of C/e has a unique expression as a 
primitive element of C + . 

Definition 2.5. We let B denote the kernel of ab, so that there is an exact sequence 
B -)■ C Hi. Define B+ := B D C+ . 

Lemma 2.6. B + spans B as a vector space, and B + — > B/e is a surjection o/R + 
modules. 

Proof. Write b € B as b+ - b~ with b+ and b~ in C+. Then b = (6+ - e(b~)) - 
(b~ — e(b~)) and both of the bracketed expressions are in B + . The second statement 
follows from Lemma 12.31 and the fact that ab factors through e. □ 

It may seem clumsy to work with R + modules, but in fact this is crucial for 
what follows. Note that although C + has a natural basis as a free R + module, 
namely the set of all reduced cyclic words in the generators, the space C/e, which 
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is canonically isomorphic to the set of primitive elements in C + , does not have a 
natural basis as an R vector space. Such a basis amounts to a choice of one element 
for each e orbit. We will return to this issue in the sequel. 

2.2. Spaces. We fix A, a K(F, 1). For instance, it is convenient to take A to be a 
wedge of \S\/2 circles, one for each mutually inverse pair of generators. 

Conjugacy classes in F are in bijection with free homotopy classes of maps 
S 1 ->X. 

Definition 2.7. Let C denote the space of R linear formal sums of free homotopy 
classes of maps S 1 — > X modulo the relation that if it : L' — > L is an oriented cover 
of oriented 1-manifolds of (positive) degree n, then the composition L' — >• L — > X 
represents n times L — > X. 

Informally, we refer to C as the space of multicurves. With this definition, there 
is an obvious isomorphism of C with C and C + with the positive orthant C + . 
Similarly, B and B + are identified with the homologically trivial multicurves. There 
is a linear map e : C — > C which reverses orientation and multiplies coefficients by 
— 1. Evidently there is a natural identification of C/e with C/e, and therefore a 
map ab : Q/e — > H\. 

A surface E is admissible if it is compact and oriented, and if each component has 
non-positive Euler characteristic. A map E — > X is admissible if E is admissible. 

Remark 2.8. It is convenient to allow some components of <9E to map homotopi- 
cally trivially to A; we will refer to such components later as "ideal components"; 
however, they do not play a significant role in what follows. 

Definition 2.9. We let S denote the space of R linear formal sums of free homotopy 
classes of admissible maps S — > X modulo the relation that if tt : E' — > S is an 
oriented cover of (positive) degree n, then the composition £' — > E — > X represents 
n times E — > X. 

Similarly, we define S + as the positive orthant, and e : S — > S reverses orientation 
and multiplies coefficients by —I. Again, S + — >• S/e is an isomorphism of R + 
modules. 

Definition 2.10. If E is admissible, and E — > X is a free homotopy class of map, 
the restriction of the map to <9E represents a class in C + . This defines a linear map 
d : S — > G which restricts to d : § + — > C + and descends to d : S/e — > C/e. 

Remark 2.11. Note that the image dS/e consists of precisely the kernel of ab, and 
can therefore be identified with B/e. 

If x denotes Euler characteristic, then because \ IS multiplicative under covers, 
there is a well-defined linear function \ on S + . The function ~x/2 is non- negative 
on S + , and invariant under — e, and therefore defines a pseudo-norm on S/e (thought 
of as an R vector space). With this definition in place, we have the following 
proposition: 

Proposition 2.12. Let b G B/e, and think of b as an element of C/e as above. 
Then 

scl(6) = inf -%(s)/2 

ds=b 

where the infimum is taken over all s G S/e. 
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The reader can take this as a definition of scl. For a proof (in which scl is given 
its usual, algebraic definition) see [8], Prop. 2.74. 

Remark 2.13. The vector space B/e is denoted B^ in [8] and elsewhere. 

3. Traintracks 

In this section we define traintracks, and prove the Traintrack Rationality The- 
orem; i.e. Theorem 13.241 This theorem is proved by encoding the computation 
of scl as a rational linear programming problem; the linear programming prob- 
lem depends rational linearly on the traintrack weight. This proves the Traintrack 
Rationality Theorem. 

3.1. Traintracks. We use the notation t for the involution on S that interchanges 
a generator with its inverse. At the level of homology, i = — e. 

Definition 3.1. A traintrack T is a directed graph with edges E and vertices V, 
together with an evaluation function h : E — > S. There are source and target maps 
s : E —> V and t : E V giving the structure of a directed graph. An ordered 
pair of edges eo,ei is composable if te<j = se\. A traintrack is reduced if for every 
composable pair (eo, ei) we have heo ^ ihe\. 

If F and S need to be stressed, we talk about a traintrack over F, S. 

Remark 3.2. Note that our traintracks are oriented] by contrast, unoriented (and 
even unorientable) traintracks are important in the Thurston theory of surface 
automorphisms . 

Recall that X, our K(F, 1), is a bouquet of circles, one for each pair of generators 
for F. If we think of T as an (oriented) 1-complex, then h is just a simplicial map 
T -4 X. 

Definition 3.3. Given a traintrack with edges E and vertices V, define linear maps 
d : M[E] -> M.[V] on generators by 9(e) = te — se and H : R[E] — > Hi by extending 
h on generators by M linearity. Define the weight space W to be the kernel 

W := kerd © H : R[E] -> R[V] © i?i 

We define the cone of positive weights W + :— W OM. + [E]. 

Lemma 3.4. W + is a convex rational polyhedral cone. 

Proof. The maps d and H are defined over Z, so W is a rational linear subspace of 
M.[E]. The intersection of a rational linear subspace with the positive orthant is a 
convex rational polyhedral cone. □ 

Lemma 3.5. Suppose that there exists a vector w 6 W with every component 
positive. Then W + has full rank in W . 

Proof. The condition implies that W intersects the interior of M + [E] , which implies 
the result. □ 

Definition 3.6. A traintrack is full if it satisfies the hypothesis of Lemma T3.5I A 
traintrack is recurrent if there is a directed path from every vertex to every other 
vertex. 

Lemma 3.7. A traintrack is full if and only if each connected component is recur- 
rent. 
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Proof. The proof is obvious. □ 

An oriented immersion S 1 —> T determines a weight in Z + [E] whose coefficient 
for an edge e is the degree with which S 1 maps over e; i.e. the degree of the map 
from S 1 to the oriented circle T/(T — e). 

Definition 3.8. We denote by C(T) the space of R linear formal sums of free 
homotopy classes of oriented immersions S 1 T modulo the relation that if 
ir : L' — > L is an oriented cover of oriented 1-manifolds of degree n, then the 
composition L' — > L — » T represents n times L — > X. 

We similarly define C + (T) to be the positive orthant. Note that it does not make 
sense to define C(T)/e. Composing S 1 — > T with h gives a map S 1 — > X, and we 
therefore obtain a linear map h : G(T) — > 6 restricting to h : C + (T) — > C + . This 
map is typically neither injective nor surjective. 

Lemma 3.9. A weight w S 1< + [E] is in the image of C + (T) if and only if it is in 
kerd. 

Proof. Represent w as a finite collection of oriented edges. The condition dw = 
is equivalent to the existence of a pairing of the vertices of this collection of edges, 
where each source vertex is paired with a target vertex and conversely. Gluing 
edges according to this pairing determines an oriented f-manifold together with a 
tautological oriented immersion to T. The converse is obvious. □ 

Lemma 3.10. A weight w s 1i + [E\ n kerc? is in keriT if and only if for any 
c 6 C + (T) representing w, the image of c in C + is null-homologous. 

Proof. This follows directly from the definition □ 

In words: an integral vector w is in W + if and only if it is in the image of an 
integral c 6 C + (T) whose image in C + is null-homologous. 

3.2. Arcs, rectangles and triangles. We suppose given a traintrack T with edges 
E, vertices V and evaluation function h as in § 13.11 

Definition 3.11. An arc is an ordered pair of vertices. If a is an arc, let a denote 
the arc obtained by exchanging the order of the vertices (i.e. by reversing the 
"orientation" of a). If K.[A] denotes the R vector space spanned by arcs, let A[A] 
denote the quotient by the relation a = —a. 

Definition 3.12. A rectangle is a cyclic list (oto, eo, oti, e\) where the cti are arcs 
and the e, are edges, for which he§ = ihei, and such that ton = set and scti = tei—i, 
indices taken mod 2. 

Definition 3.13. A triangle is a cyclic list of arcs (ao, c*i, c*2) such that ton = 
scfcj+i, indices taken mod 3. 

Definition 3.14. Let P denote the set of polygons (i.e. rectangles or triangles as 
above). There are linear maps D A : R[P] -s- A[A] and D E : R[P] -)• R[E] defined 
on generators in the following way. 

If r = (otQ, eo, cti, e\) is a rectangle, D A r = ao + ct\ and Det = eo + e i- If 
t = (ao, ai, a.2) is a triangle, Dat = «o + ct\ + a-i and Det = 0. 

We also use the notation D := Da @ D E : R[P] -> A[A] ® R[E]. 
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If t = (ao,ai,a2) is a triangle, we define r := (a2,ai,cto). Note that D(t) = 
-D(r). 

Definition 3.15. Define the polygon weight space Y to be the kernel 

Y := kerD A : R[P] -> A [A] 

We define the cone of positive weights Y + := Y Dl + [P]. Furthermore, define 
y C M[P] to be the linear span of Y + . 

Remark 3.16. We do not assume that Y + has full rank in Y. 

Let E be an admissible surface, and suppose we have a map E — y X. Suppose 
further that <9E contains some components d l Yi (called ideal components) mapping 
to the basepoint of X, and that the restriction of the map to the other components 
<9 e E := <9E — <9 l E factors through an oriented immersion <9 e E — > T. Choose such 
an immersion to T . 

It is convenient to use the language of fatgraphs. 

Definition 3.17. A fatgraph is a graph together with a choice of cyclic order of 
the edges incident to each vertex. 

A fatgraph Y can be canonically fattened to an oriented compact surface Ti(Y) 
with boundary in which it sits as a spine; that is, in such a way that E(l") deforma- 
tion retracts to Y. See Figure [1] for an example. For an introduction to fatgraphs, 
see e.g. [30] § 1. We will return to fatgraphs again in the sequel, especially in § 15.31 
and § El 




Figure 1. A fatgraph and its fattening 

Now, suppose that E as above is not compressible. Then E has a fatgraph 
representation, i.e. there is some fatgraph Y so that E = E(Y"), and a simplicial 
map from Y to X for which the given map from E to X factors through the 
deformation retraction to Y. The proof of this fact is implicit in Culler [T7]; see 
also [8], especially § 4.1. 

Each edge of the fatgraph determines a rectangle, and each vertex of the fatgraph 
determines a polygon which can be decomposed (efficiently) into triangles. Call the 
result a triangulated fatgraph over T. 
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Definition 3.18. We denote by B(T) the space of K linear formal sums of trian- 
gulated fatgraphs over T modulo the relation that if it : E' — > E is an oriented 
cover of degree n (compatible with the triangulation and fatgraph structure) then 
E' represents n times E. 

We similarly define S + (T). There arc tautological maps S(T) — > S and S + (T) — > 
S + given by forgetting the fatgraph structure and the choice of oriented immersion 
9 e E — > T. Note that the function \ on S + pulls back to a well-defined function % 
on S+(T). 

Decomposing a triangulated fatgraph into rectangles and polygons determines a 
map §(T) ->R[P]. 

Lemma 3.19. A weight y S 2Z + [P] is in the image of S + (T) if and only if it is 
in kevDA- In particular, the image o/S + (T) in R[P] is exactly Y + . 

Proof. One direction is obvious, since every weight in the image of § + (T) is in 
kerl?^. Conversely, let y be even positive integral in kerD^- We consider a collec- 
tion of rectangles and triangles determined by y, and try to glue boundary arcs in 
pairs. 

By the definition of the vector space A [A] and the fact that y is even, the arcs 
can be paired and the polygons glued up to make a surface. Each rectangle admits 
a tautological map to X coming from the edge labels, in such a way that the 
boundary arcs map to the basepoint. Map each triangle to X by the constant map. 
The result is a compact oriented surface E mapping to X. The components <9 e E 
admit a canonical lift to T determined by the collection of edges. This completes 
the proof. □ 

Given E — > X representing a class in S + (T), the boundary <9 e E — > T represents 
a class in C + (T). This defines a linear map d : § + (T) —> C + (T) compatible with 
d : S+ -> C+. 

Lemma 3.20. The image ofY + under De is W + . Moreover, ifT is full, then the 
image of Y under De is W. 

Proof. Every y G Y + is in the image of s G S + (T). Evidently Dsy is equal to 
the image of ds G C + (T) and is therefore in W + by Lemma T3. 101 This proves one 
direction. 

Conversely, every rational w G W + is in the image of some c' G C + (T) whose 
image c G C + is in B + . Therefore there is s G S + with ds = c and after compressing 
s if necessary, we can represent s by a triangulated fatgraph. Because ds is in the 
image of C + (T), this triangulated fatgraph determines s' G § + (T) with ds' = c', 
and therefore if y G Y + is the image of s' we have P>eV — w- This proves the first 
claim. 

To prove the second claim, observe that if T is full, then every w G W is of the 
form w + — w~ for G W + . Choose y ± G Y + mapping to under De- Then 
DE{y + -y~)=w. □ 

3.3. Euler characteristic. 

Definition 3.21. Define \ : M + [_P] — > M to be on rectangles, and —1/2 on 
triangles, and extend by linearity. 

Lemma 3.22. Suppose y G Y + and let s G S + (T) map to y. Then x(s) — x(y). 
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Proof. By linearity, it suffices to prove this on generators. So let £ be a triangulated 
fatgraph over T. This fatgraph deformation retracts onto a spine, composed of an 
edge for each rectangle, and a tripod for each triangle. Edges and tripods are 
glued along 2-valent vertices (corresponding to paired arcs in S) . Consequently the 
resulting graph is trivalent, with one vertex for each triangle. For a trivalent graph 
with v vertices and e edges, e = 3i>/2 and \ = v — e ~ v ~ Zv/2 — —v/2. □ 

Putting this together with our earlier observation that \ '■ & + (T) — > R factors 
through S + (T) — > S + , we obtain a commutative diagram 

S+(T) ► S+ 

x 

Y+ — > R 

3.4. Linear programming and rationality. For w € W + , let G + (w) C C + (T) 
denote its preimage in C + (T). This is a convex set. Since the image of C + (i/i) is 
contained in B + , the function scl can be pulled back to C + (w). 

Definition 3.23. For w G W + , define scl(iy) to be the infimum of scl(c) over all 

c g e+(w). 

Theorem 3.24 (Traintrack Rationality). For w G W + , there is a formula 

scl(u;) = inf{— x(y)/2 : y G Y + and De(u) = w} 

Moreover, the infimum is achieved, and if w is rational, the infimum is achieved at 
a rational y. Furthermore, the function scl(w) varies piecewise rational linearly on 
W+. 

Proof. Since W + and Y + are rational polyhedral cones, the infimum is achieved, 
and if w is rational, it is achieved at a rational y. 

The function \ on & + {T) pushes forward to x on S + , and also pushes forward to 
X on Y + . Since every surface can be compressed (thereby reducing —\) until it is 
represented by a fatgraph, we conclude that for any homologically trivial c G C + , 

scl(c) = inf{-x(s)/2 : s G S+(T) and ds = c} 

Taking a further infimum over all c G C + (T) mapping to it; we obtain the desired 
result. 

The last assertion is just the observation that the output of linear programming 
with fixed variables depends piecewise rational linearly on the constraints. □ 

Note that the function — x/2 on Y + extends in an obvious way to a pseudonorm 
|| • || on R[F] taking the value on rectangles, and 1/4 on triangles. With this 
definition, scl(w) = inf \\y\\ over all y G Y + with Dsiy) = w. 

4. Counting functions and posimorphisms 

In this section we consider a canonical family of traintracks associated to a free 
group F and a choice of free generating set, namely the (so-called) verbal traintracks. 
Dual to the weights on a verbal traintrack are the counting functions, and the 
antisymmetric functions are exactly the (homogeneous) counting quasimorphisms 
considered by Rhemtulla [32] and Brooks [5]. 
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We show that the system of (normed) weight cones W/~ on all the verbal train- 
tracks together faithfully recover the geometry of the (infinite dimensional) normed 
cone B + . Duality (i.e. the Hahn-Banach theorem) applied to these finite dimen- 
sional cones shows that for each b e B + there is a rational counting function <p 
which, thought of as a linear function on £?+, has a level set that is a supporting 
hyperplane for the scl ball at &/scl(&). 

4.1. Formalism. Suppose we have vector spaces Y, W and a surjective linear map 
D :Y — > W. A pseudonorm || • || on Y determines a pseudonorm || • || on W by 

Hl= inf >|| 

y eD- 1 (w) 

Suppose further that we have convex cones Y + , W + in Y, W of full dimension, so 
that DY + = W + . There is a different pseudonorm || • || + on W + defined by 

IMI + = m f \\y\\ 

yeD- 1 (w)nY+ 

Since D^ 1 (w) n Y + is a subset of there is an inequality 

\\w\\+>\\ W \\ 

but equality will not hold in general. Given w, the restriction of || • || to the affine 
subspacc D^ 1 (w) is convex, so if the infimum is not realized on the convex subset 
Y + HD^ 1 (w), the infimum over this subset is achieved on the boundary. Conversely, 
if ||to|| + is achieved by some y in the interior of Y + , then ||i«|| + = \\w\\. 

4.2. Verbal traintrack. 

Definition 4.1. Fix some length £. Define the verbal traintrack of width i (denoted 
Tg) to be the following directed graph. 

(1) The vertices are the reduced words in F of length I — 1. 

(2) The edges are the reduced words in F of length i . 

(3) For an edge e define se and te to be the prefix and suffix of e of length 

(4) For an edge e define he to be the first letter of e. 

Note that Ti is full for any £. For the remainder of this section we fix a width t 
and work in the associated traintrack Te. 

Remark 4.2. Defining he to be the first letter of e is just a convention; if we fix some 
i < £ and define he to be the ith letter of e, we obtain isomorphic normed spaces and 
cones, and moving back and forth between different conventions is straightforward 
(if somewhat messy). 

We define £ to be the vector in R[E] with coefficient 1 on every edge, and ft the 
vector in M.[P] with coefficient 1 on every rectangle. 

Lemma 4.3. £ e W + and ft e Y+ , and D E 5i = (\S\ - l)^ 1 ^ 

Proof. Let e be an edge, with label he. There is a bijection between the set of e' 
with he' — ihe and the set of rectangles r with D E r — e + e'. By the definition of 
Te, given e the number of such e' is (|5| — l)^ -1 where S is the symmetric generating 
set for F. □ 
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We use the notation || • || and || • || + from § 14.11 for the pseudonorms on W and 
W + associated to the pseudonorm on Y taking the value 1/4 on triangles and 
on rectangles. With this notation, \\w\\ + = scl(u;) for w G W + and therefore 
scl(iy) > \\w\\. Furthermore, scl(£) = ||£|| = 0. If we need to stress the dependence 
of || • || and || • || + on £, we write \\w\\i and ||w||^~. 

Lemma 4.4 (formula for norm). For any w G W + there is a formula \\w\\ — 
inf t scl(w + t&) over all t > 0. This infimum is achieved for all t > t(w). Moreover, 
t(w) depends piecewise rational linearly on w. 

Proof. First of all, 

HI < H + *£|| + II -*£|| = H + *£|| < sci(w + <£) 

so one direction is proved. 

Conversely, Let y be a vector in D^}(w), and write y — y + — y~ for y^ G Y + 
where we assume y + and y~ have disjoint support in P. Let x be the part of y~ 
supported in triangles, and let x be obtained by replacing each t in x with r. Then 
D{x + x) — so we can write y' — (y + + x) — (y~ — x) with ||y'|| = ||y|| and 
De(u') — V- We must be careful, since y + + x is not necessarily in Y + , although it 
is in R+[P]. 

Now, since y~ — x consists entirely of rectangles, there is z € M. + [P] so that 
y~ — x+z — t!R for some positive t. Hence we can write y" = (y + +x+z) — x+z) 
where now (y + + x + z) and (y~ —x + z) are in Y" + , and \\y"\\ = \\y'\\ — \\y\\- Since 
Ds(y + +x + z) = w + t£ we have \\y\\ = \\y"\\ > scl(w + t£) and we deduce 
H|| = inf t scl(io + t£). 

By construction, scl is piecewise rational linear and convex, and therefore the 
derivative ^scl(ui + t£) is locally constant with finitely many jumps, and is strictly 
increasing at those jumps. On the other hand, < sc\(w + t£) < sc\(w) for all 
t € [0,oo), so scl(it) + t&) is eventually constant and equal to its infimum. If w is 
rational, the infimal t with this property is rational. □ 

4.3. Canonical immersion. Let Ti denote the verbal traintrack of length £. There 
is a canonical map $^ : C -> ker<9 C WL[E] defined on a cyclicly reduced word g 
by reading the subwords of length £ of the bi-infinite power g with initial letter 
contained in a fixed fundamental domain. This map restricts to surjections : 
B -> W and §i : B+ -> W+ . 

Note for any b G B + we have inequalities 

||^(&)|| < \\®l{b)\\ + <scl(6) 
On the other hand, we have the following estimate. 

Lemma 4.5. Let b G B + , and suppose that every primitive word in the support of 
b has length at most £ — 1. Then ||<Iy(&)|| + = scl(&). 

Proof. The condition implies that the support of $f (6) is an embedded 1-manifold 
in Ti, whose components correspond to the components in the support of b. Con- 
sequently $^ 1< I > ^(6) (~1 B + = b and the lemma follows from Theorem 13.241 □ 

We make the following definition: 

Definition 4.6. Define e : R[E] —> R[E] on generators by e — > — e^ 1 . 

This might appear to cause confusion with e : B — > B defined earlier; however, 
no confusion should arise in practice because of the following lemma: 
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Lemma 4.7. For any b £ B we have e$>ib = <&ieb. Moreover, e8.g = —£.£. 

Proof. These facts are immediate from the definitions. □ 

4.4. Posimorphisms. We briefly interrupt the discussion to recall some elements 
of the dual theory to scl, which is to say, the theory of quasimorphisms. 

Definition 4.8. Let G be a group. A homogeneous quasimorphism is a function 
<fi : G — > R for which there is a least non-negative real number D{(j>) (called the 
defect) so that 

(1) for any jeG and any n £ Z there is equality <j){g n ) = n<j)(g); and 

(2) for any g,h £ G there is an inequality \4>(gh) — <f>(g) — 4>(h)\ < D{<j>). 

The R vector space of homogeneous quasimorphisms is denoted Q(G), or just Q 
if G is understood. 

A homogeneous quasimorphism is necessarily a class function; together with the 
homogeneity condition, we see that a homogeneous quasimorphism descends to an 
R- linear function : B/e —> M. Any R-linear function <f> : B/e — >• R extends to 
C/e — > R, and any two extensions differ by an element of H 1 (G) . We can give a sim- 
ple alternate characterization of homogeneous quasimorphisms which underscores 
its relationship to scl. 

Lemma 4.9. An R-linear function (f> : B/e — > R is (the restriction of) a homoge- 
neous quasimorphism with defect < D((f>) if and only if for every admissible surface 
S there is an inequality 

<KdS)<- x -(S)-D(<i>) 

Equivalently, for any b £ B/e, 

<f>{b) < scl(6) • 2D{4>) 

Proof. The equivalence of the two inequalities follows from Proposition 12.121 The 
first inequality is vacuous when S is closed or a disk. If S is an annulus, cf>(dS) = 
follows from homogeneity, since (/>(&" x ) = —0(6). In every other case, S can be 
cut up into thrice-punctured spheres. If S = Si U S% glued up along a (with 
opposite orientations) then (f>{dS\ + dS^) = <fr(dS) (because 0(a _1 ) = —(f>{a) by 
homogeneity), and the lemma is proved. □ 

The second inequality in the statement of the lemma shows that a homogeneous 
quasimorphism is precisely an element of the dual of B/e with its scl pseudonorm, 
and that 2D is the operator norm. This gives a remarkably short proof of the 
following well-known duality theorem: 

Theorem 4.10 (Generalized Bavard Duality; [3j|8]). Let G be any group. Then 
for any b £ B/e, there is equality 

scl(b) =sup4>(b)/2D{4>) 

where the supremum is taken over all homogeneous quasimorphisms <j). 

On the other hand, working with B + lets us consider a more general class of 
dual functions. Motivated by Lemma T4.91 we make the following definition: 



16 



DANNY CALEGARI AND ALDEN WALKER 



Definition 4.11. Let G be a group. An R + -linear function <j> : B + — > K is a 

posimorphism if there is some least non- negative number D{4>) (called the defect) 
so that for any b G B + there is an inequality 

<t>(b) < bc1(6) • 2D((f>) 

We denote the space of posimorphisms by P(G), or just P if G is understood. 
Note that P is an R + -module (i.e. a convex cone), and is not in general a vector 
space. The Hahn-Banach theorem implies that for any b g i? there is some posi- 
morphism <f> with -D(0) = 1 and (f>(b) = 2scl(6); we call any such <p extremal for 
b. In the next section we will see that the theory of traintracks can be used to 
construct an explicit extremal posimorphism, dual to any b € B + (F) for F a free 
group. 

Remark 4.12. The relationship between quasimorphisms and posimorphisms is sim- 
ilar to the relationship between harmonic and subharmonic functions, or between 
martingales and submartingales. 

4.5. Counting functions. In this subsection we introduce the class of counting 
functions. The subclass consisting of antisymmetric counting functions (discussed 
in § 14. 6|) are the most well-known class of quasimorphisms on free groups. These 
counting quasimorphisms were introduced by Rhemtulla [32], and further devel- 
oped by Brooks [5 . Similar functions were introduced by Epstein- Fujiwara |19) . 
although with a subtly different definition. The Brooks- Rhemtulla quasimorphisms 
are sometimes called "big counting quasimorphisms" whereas the Epstein-Fujiwara 
quasimorphisms called "small counting quasimorphisms" if it is necessary to dis- 
tinguish them. 

The following problem is central, and well-known: 

Question 4.13 (Counting Quasimorphism Question). Does every chain in a free 
group admit an extremal counting quasimorphism? 

See e.g. Grigorchuk [23J for a partial discussion. We show that by changing the 
problem slightly, we can give a positive answer, in the form of the Counting Duality 
theorem (below) , which says that every element in a free group admits an extremal 
counting posimorphism. Our analysis further clarifies the difference between posi- 
morphisms and quasimorphisms, and leads to a suggestive thermodynamic model 
of the potential difference (see Remark |4.25[) . 

We begin with the definition of counting functions. 

Definition 4.14 (Counting function). If a is a reduced (noncyclic) word in F, the 
counting function C" T (-) counts the number of copies of a in a cyclically reduced 
word. Hence for g a reduced cyclic word, C a (g) counts the number of copies of a 
in g. 

The function C a extends by linearity to C a : C — > M. We let C\ denote the 
linear span of the space of counting functions on all words a of length £, and we let 
C* ount denote the union over all I. There is a map from R[F] to C* ount taking a 
to G a and extending by linearity, but this map is not injective. For example, every 
cyclically reduced word is either a power of a, or else each string of consecutive 
a's starts and ends with one of b or B; hence C ab + C aB = C ba + C Ba . In other 
words, the representation of some 4> G C\ as an expression of the form ^to-C" 7 is 
not unique. 
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The next lemma shows that any counting function has a representative in which 
all the words a in the support can be taken to have some fixed length (equal to the 
suprcmum of the lengths of the words appearing in any representative). 

Lemma 4.15. If £ < £' then C\ is a subspace ofCp. 

Proof. For every reduced word a' of length £' let pre^(cr') denote the prefix of length 
£. Then 

C a -> (2|5| - iy- 1 ' C °' 

prc £ (ct')=<7 

extends to an identification of C% with a subspace of C\, . □ 

It is natural to think that there should be a strong connection between counting 
functions and verbal traintracks. This is summarized in the following lemma. 

Lemma 4.16. C\ is the dual of and Cg/H 1 is the dual of ${B = We. 

Proof. It is obvious from the definition that C\ factors through Conversely, 
any linear functional on = ker9 C M.[E] extends to a linear functional on R[E] 
which can be thought of as an clement of C* t in an obvious way. □ 

The next lemma illustrates the relationship between counting functions and posi- 
morphisms. 

Lemma 4.17. A counting function <f> G CI is a posimorphism (with some defect) 
if and only if <f)(c + c _1 ) < for all c G C. 

Proof. The condition is necessary, since c + c _1 G B, and scl(c + c _1 ) = for 
any c G C. Conversely, let b G B be an arbitrary rational chain, and let S be 
an extremal fatgraph for b, so that dT, represents an oriented cover of b of some 
degree. By linearity it will suffice to consider the case 9S = 6, and we must prove 
that (p(b) < — x(S) • 2D((j)) for some constant D{cf)) depending only on <f>. If some 
component of S is an annulus, its boundary is of the form c+c -1 , so without loss of 
generality we can assume that no component of £ is an annulus, and every vertex 
v of the fatgraph has valence(w) > 3. 

We suppose that we an write <fi = J £2t i C ai for some real numbers t i7 where Ui 
ranges over Fg. We let ej, ej 1 denote the set of (oriented) labels appearing as edges 
of X; i.e. the collection of cyclic words b can be cut up at finitely many points into 
the collection of ordinary words ej, ej . It makes sense to evaluate a counting 
function on an ordinary (non-cyclic) word, by counting copies of subwords <Ji . The 
only copies of Oi that occur in dT. but not in some ej or ej 1 are those that map 
over some vertex of the fatgraph. Hence 

\m - 5>(e,) + <P{e^))\ < ]T valence^) • t • £ %\ 

j v i 

On the other hand, for each j the difference of 4>(ej) as an ordinary word and as a 
cyclic word is also bounded by £ ■ J2 i \U\, and therefore we can estimate 

<t>{b) < 2 Y valcnce(w) • £ ■ ^ \U\ 

v i 

Since every vertex v has valence at least 3, and since — x^(S) = J2 v ( vs ^ ence ( v ) ~ 
2)/2, it follows that we can estimate 

0(6)<-12 X -(£)-*-5>il 
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so that 4> is a posimorphism with D(4>) < 12£ ■ J^. |t$|. □ 

Remark 4.18. We can give a simpler, although less sell-contained or direct prool ol 
Lemma 14.171 using the theory ol verbal traintracks already developed. Evidently, 
4>(b) = 4>{$i{b)) lor any b G B, so to show that <j> is a posimorphism, it suffices to 
show that 4>(w) < scl(iu) • 2Z?(</>) for all w G and lor some D(<p). However, 
since is finite dimensional and <j> is linear, it suffices to have 4>{w) < whenever 
scl(w) = 0. Now, if scl(ui) = 0, there is some b € <&J 1 (w) with scl(6), and every 
such b is of the form c + c _1 . The proof follows. 

Finally we come to the counting duality theorem, which explicitly constructs a 
rational extremal counting posimorphism dual to any b G B. 

Theorem 4.19 (Counting Duality). For any b G B + there is an I and a rational 
linear function 4> G C\ '/ ' H 1 so that (j){b r ) < \\<$> e {b')\\ + < scl(b') for all b' G B+ and 
<p(b) = scl(6). In other words, ip is an extremal posimorphism for b. 

Proof. We take £ larger than any word in the support of 6, so that ||$£(6)|| + = scl(6) 
by Lemma 14.51 Consider the unit ball in the pseudo-norm || • || + on . This is 
a closed, convex, rational polyhedron. Consequently there is a rational supporting 
hyperplane which intersects the boundary of the unit ball along the ray E + &e(b). 
Such a hyperplane is the level set cf> = 1 for some <j> G C* t , by Lemma 14.161 By 
construction, 0(6') < ||^(6')|| + < scl(6') for all b' G B+ , but cj>{b) = ||$*(&)||+ = 
scl(6). □ 

4.6. Antisymmetrization. We now spell out the relationship between the norm 
|| • ||^ and the space ol (homogeneous) counting quasimorphisms, which are precisely 
the antisymmetric counting functions. 

Definition 4.20. A counting function </) is antisymmetric if (^(g^ 1 ) = —4>{g) for 
all g. An antisymmetric counting function is called a counting quasimorphism. The 
space of counting quasimorphisms is denoted Qcount, and its intersection with C\ 
is denoted Qi. 

Lemma 4.21. For any b G B there is an equality \\$tb\\g. = ^\\^e(b + eb)\\e. 

Proof. For any b G B + the chain b — eb G B + is the union of two copies of b with 
opposite orientations. Evidently there are equalities 

= scl(6-e6) - ||^(6 -e6)||+ = \\* t (b-d>)\\ t 

for any I. By linearity, the same is true for any b G B. We estimate 

+ efo))U = + &)\V + WMb - eb)\U 

> \\§ t (b + eb+b-A)\\t = ||*<(26)||< 
= \\$ t (2b)\\t+\\$i(-b+eb)\\t 

> ||*/(26-6 + e6)|| / = ||$ / (6 + e6)|| / 

Each inequality above is therefore an equality, and the lemma is proved. □ 

It follows that ||$f6||f is well-defined on B/e. This is significant, because is 
not well-defined on B/e. 

Definition 4.22. Define V e : C -> R[E] by *^(c) = i$ £ (c + ec). 
Note that ||$f(6)||^ = ||^(&)|k for any b G B, by Lemma|4~2U 
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Lemma 4.23. Qe is the dual of^eC and Qe/H 1 is the dual of^eB = (id + e)We- 

Proof. This follows just as in the proof of Lemma 14.161 together with the obser- 
vation that the adjoint of c — > |(c + ec) on C is the "antisymmetrization map" 

e* :C G ^\{C a -G a ^) o^C* t . □ 

The retraction We — > + e)We factors through the quotient We — > Wi/e, and 
shows that this quotient has a section. We have shown that || • \\e descends to a 
pseudo-norm on We/e, and that the dual of We/e is isomorphic to Qe/H 1 by the 
adjoint of \t^. 

Theorem 4.24 (Counting Quasimorphism Duality). For any £ and any b £ B/e, 
there is an equality 

m(b)\u= sup 

<t>eQe/m 2D(0) 

Proof. The map ^e from B/e to We/e pulls back the pseudonorm || • \\e to a 
pseudonorm on B/e which satisfies |<5>(&)||^ < scl(6). Moreover, for every w € W/e 
there is some b 1 6 B/e with &e(b') = w and \\w\\e = scl(fr'). 

Choose a supporting hyperplane for the unit ball in the pseudonorm || • \\e on 
W/e that intersects the boundary of the ball at ^>e(b). Let b' £ B/e be such that 
V^(6') = ^ e (b) and scl(6') = \\^e{b)\\e- Thinking of Qe/H 1 as the dual of W/e, this 
supporting hyperplane is the level set of some 4> S Qe/H 1 . We have <fr(b') — scl(6') 
and \<p(b")\ = \\y e {b")\\ e < sc\(b") for every b" e B/e. Since sc\(gh - g - h) < 1/2 
for all 5, /i we have D(<f>) < 1/2. On the other hand, 10(^/1 ~ g — h)\ = 1/2 whenever 
the thrice-punctured sphere bounded by g/i — g—h immerses in an extremal surface 
for b'. □ 



Compare with the Generalized Bavard Duality Theorem (i.e. Theorem I4.10|) . 
We deduce that a chain b admits an extremal counting quasimorphism if and only 
if ll&l^ = H&H+ for some I. 

Remark 4.25. For b 6 B/e we may suggestively refer to the quantity sup^ ||^(6)||^ 
as renormalized scl, in contrast to bare scl (i.e. to scl(6) itself). As explained in 
[8], § 4.3.6 there is a thermodynamic interpretation of scl, where labeled strands 
combine into configurations (i.e. fatgraphs) in such a way as to minimize energy 
(as measured by — x). If 6-strands are inserted into a "Dirac sea" of free strands, 
the renormalized energy may be less, because fatgraphs may form that pair b with 
collections of free strands that are themselves capable of being paired into parallel 
strands at increasing scales t — > oo. 



5. The geometry of Qe 

The theory of counting quasimorphisms as described in § |4] has the potential to 
build a bridge between topology and combinatorics. In this section we describe the 
geometry of Qe/H 1 as a normed space with its defect norm, for free groups of finite 
rank, with special attention to the case of £ = 2. 



5.1. Fatgraphs, ends, fat folding. In the sequel we assume the reader is familiar 
with Stallings' theory of graphs and graph foldings; see [36] . 
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Definition 5.1. A fatgraph with ends is a graph with some 1-valent vertices (the 
ends). The fattening of a fatgraph with ends is a surface with corners; there is one 
edge of the boundary associated to each end of the graph; these edges are called 
infinitesimal edges. 

Definition 5.2. Let A be a graph. A fatgraph over AT is a fatgraph Y together 
with a simplicial map Y — ¥ X. A fatgraph over X is folded if the induced map 
dYj(Y) — » X is an immersion. 

The most important case to consider is when A is a standard rose for a free group 
F with a fixed (symmetric) generating set S. A fatgraph Y over the standard rose 
is also called a labeled fatgraph. We can think of it as being given by assigning a 
reduced word to each oriented edge of Y. The fatgraph Y is folded if the induced 
labels on c?E(Y) are cyclically reduced. 

A simplicial map Y — > X between graphs which is not an immersion (i.e. which 
is not locally injective) can be iteratively Stallings folded, by identifying pairs of 
oriented edges of Y with the same initial vertex which map to the same oriented 
edge of A. If Y is a fatgraph, a Stallings folding is a fat folding if the edges to be 
identified are adjacent in the cyclic order at the initial vertex. The result of a fat 
folding is a new fatgraph. A sequence of fat foldings is elementary if it induces a 
homotopy equivalence of fatgraphs. Every fatgraph Y over a graph A may be fat 
folded to produce a canonical folded fatgraph Y' over A. 

If F is a free group with symmetric generating set S, and A is a standard rose 
for F, a fatgraph Y over A is also called a labeled fatgraph. An oriented edge of Y 
determines a word in S. If Y is folded (as a fatgraph), the resulting labels on t?E(Y) 
are cyclically reduced. If Y is a fatgraph with ends, <9£(Y) is labeled with reduced 
cyclic words, and reduced non-cyclic words starting and ending at an infinitesimal 
edge. 

Now, suppose Y',Y are two folded fatgraphs over A. We say that Y' immerses 
in Y as a fatgraph if there is a fatgraph Y" that immerses in Y in the usual sense 
(i.e. as an immersion of graphs, compatibly with the cyclic orders at each vertex), 
and Y' is obtained by elementary fat folding Y". Note that this implies that Y' and 
Y" are homotopic, and that the surface £(Y") which is homeomorphic to S(F') 
immerses in £(Y) (in the usual sense). The point of this definition is that even 
though Y might be folded over A as a fatgraph, it might not be folded as an ordinary 
graph, and therefore there might be subfatgraphs which are not themselves folded. 

The following theorem is a restatement of Theorem B from [TO] : 

Theorem 5.3. Let V be a chain in B/e, and let X be a labeled fatgraph so that 
<9£(A) = r, and E(A) is an extremal surface for T. Let X' be any fatgraph that 
immerses in X , and let V = <9E(A'). Then E(A') is an extremal surface for V . 
Moreover, if <f> is any guasimorphism extremal for T, then <f> is extremal for V . 

5.2. Tripods. A tripod T is a fatgraph with three ends; its underlying graph is the 
cone on three (cyclically ordered) points. We consider tripods which are folded over 
the standard rose. Hence if x, y, z are the three labels on the edges of the tripod 
(in positive cyclic order) then 9£(T) = Xy + Yz + Zx. A tripod has length £ — 1 
if the words x, y, z all have length I — 1. 

Now, suppose <f> € Qe is a counting quasimorphism, and let T XtVtZ be a tripod of 
length £ — 1. We define 

4>(T x , y , z ) := 0(aS(T x , v ,,)) = <j>(Xy + Yz + Zx) 
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Lemma 5.4. Let 4> 6 Qg. Then D(<j>) — max z ^ iZ 2(f>(T x ,y,z) where the supremum 
is taken over all tripods of length £ — 1 . 

Proof. By counting duality (i.e. Theorem 14 .241) there is some T with cf>(T)/2D((j)) = 
scl(r). After replacing T by a finite cover if necessary, there is a labeled fatgraph 
X such that dT^(X) — T, and scl(r) = — x(X)/2. After replacing by a multiple 
if necessary we can assume D((f) = 1, so that <fi(T) = ~x(X). Now, let T be 
any tripod of length £ — 1 that immerses in X. We can build a new fatgraph Y 
homotopic to a pair of pants that immerses in X, from two copies of T joined by 
three edges of length > 2£. By Theorem 15.31 and by construction, 

1/2 = - X (Y)/2 = scl(aS(F)) = <P(dZ(Y))/2 = J>(T) 

This proves that D((f>) > m&XT 2(f>(T) . Conversely, if T is any tripod of length I— 1, 
we can build a fatgraph Y from two copies of T joined by edges of length > 2£. 
Then 

<f>(T) = <j>{dY,{Y))/2 < scl(9S(r)) < -xCn/2 = 1/2 

□ 

We can therefore define for each T of length £ — 1 the tripod hyperplane ttt to 
be the affine subspace of Qg consisting of cf> with <fi(T) — 1/2. By the definition of 
defect, and Lemma T5. 41 we deduce the following: 

Lemma 5.5. The tripod hyperplanes ttt as T ranges over the tripods of length £—1 
are a complete set of supporting hyperplanes for the unit ball in Qi/H . 

For each tripod T of length £ — 1, define the tripod face ttt to be the intersection 
of ttt with the boundary of the unit ball in Qi/H 1 . 

It follows in particular that every codimension one face of the unit ball is a tripod 
face. However, it is by no means clear which tripods correspond to codimension 
one faces of the unit ball. In the next few sections we will show that every ttt is 
nonempty, and in the special case of £ = 2 that every ttt is a codimension 1 face. 

Corollary 5.6. Let Y G B/e be arbitrary, and let X be a fatgraph whose fattening 
is an extremal surface for T. Fix some £, let Ti denote the set of tripods of length 
£ — 1 that immerse in X , let tti be the associated tripod hyperplanes, and let ixi be 
the tripod faces. The set of <fi £ Qi/H 1 extremal for T is precisely equal to the cone 
on a i Ti l . 

Proof. This follows immediately from Theorem 15.31 and Lemma 15 .51 modulo the 
following claim. We claim that we can find a collection {ij} of — 2\(X) tripods Ti. 
immersing in X so that ^(T) = ■ ^e(Ti j ) (see Definition 14.221 for the definition 
of ^e). Assuming this claim, the converse is proved as follows. Any (f> S DiiTi 
satisfies D(cf>) = 1 and 4>(Ti) — 1/2 for every tripod immersing in X. In particular, 
if such a cj> exists, then <f>(T)/2D{4>) = Ej^( T ij)/ 2 = -x(X)/2 = scl(r) so <j> is 
extremal for T. 

The claim is proved by a standard argument in hyperbolic geometry. Pick a 
hyperbolic metric on S(A) with geodesic boundary, and fix an ideal triangulation 
of this surface in which the ideal triangles spin around the boundary. Now deform 
the metric so that the length function (projectively) degenerates to the length 
function on the graph X. After rescaling the metric, the diameter of the thick 
part of each ideal triangle goes uniformly to zero, and the center of this thick part 
converges to one of the vertices of X. Cut off the ends of each triangle so that the 
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length of each end is equal to £ — 1 in the rescaled metric. Each triangle degenerates 
to a tripod. This collection of tripods evidently satisfies the claim. □ 

We call a collection Ti of tripods of length I — 1 compatible if the associated 
tripod faces iti have a nonempty intersection in the boundary of the unit ball in 
Qi/H 1 . Corollary 15.61 implies that any fatgraph built from copies of a compatible 
collection of tripods will be extremal for its boundary. 

5.3. Basic realizations and rotation quasimorphisms. Fix a free group F of 
finite rank, and a finite symmetric generating set S. If the rank of F is k, then 
|5| = 2k. We fix a standard rose X, i.e. a graph with one vertex * and k edges, and 
an identification of F with w± {X, *) in such a way that the elements of S correspond 
to the oriented edges of X . 

A cyclic ordering O on S gives X the structure of a fatgraph with fattening Eo 
(see Definition 13. 17[) and thereby determines an isomorphism from F to 7Ti(Eo) up 
to conjugacy. We call such an isomorphism a basic realization with basis S. 

The embedding of X in E determines an embedding of the universal cover X in 
E, which is homeomorphic to the (oriented) plane, and this embedding determines 
a cyclic ordering on £(X), the space of ends of X. Since H 2 {F) = 0, this action on 
a cyclically ordered set determines a rotation quasimorphism roto £ Q/H 1 ', see [8] 
§ 2.3.3 for details. 

In fact, for the rotation quasimorphisms arising from basic realizations as above, 
it turns out that roto £ Q2/FI 1 , and we can give an explicit formula for a repre- 
sentative. Recall that we abbreviate inverse of a generator (given as a lower case 
letter) by capitalization, so that X means a;" 1 , and so on. 

Theorem 5.7 (Rotation number formula). Let O be a cyclic order on S. For each 
pair of letters x,y £ S with y not equal to X , there is a linear order 015 — X on 
S — X , and we let n xy denote the (signed) distance from x to y in this linear order. 
Then there is a formula 

See Thm. 4.76 for a proof. 

Remark 5.8. Some version of this formula was originally discovered by Chilling- 
worth [15 , although it was expressed in quite different language, and he did not 
make the connection to the theory of quasimorphisms. 

In |10j it is proved that rotation quasimorphisms associated to realizations are 
extremal vertices in the space of all homogeneous quasimorphisms; i.e. that they 
are dual to codimension one faces of the scl norm on B/e. In particular it follows 
that they determine vertices of the unit ball in Qi/H 1 for any £ > 2. 

We now relate rotation quasimorphisms to tripod faces. 

Lemma 5.9. Let T be a tripod face of length £ — 1, and roto a rotation quasimor- 
phism. Let x,y,z be the initial letters of the edge labels on T. Then ttt contains 
roto if and only if x,y,z are positively circularly ordered in O. 

Proof. This follows immediately from the definition, and from the formula in The- 
orem 15.71 Alternately, if Y is a fatgraph made from two copies of T, then either 
Y or Y with the opposite orientation immerses in X with the fatgraph structure 
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associated to O. The surface T,{X) is extremal for dT,(X), and roto is extremal for 
d~E(X). Now apply Theorem O □ 

5.4. The geometry of Qi- Let A denote the complete simplex on S, a symmetric 
free generating set for F. Let C* denote the complex of simplicial (real-valued) 
cochains on A. For each i, the symmetric group Si+i acts on C l by permuting the 
vertices of an i-simplex; call a cochain antisymmetric if it transforms by the sign 
character under the action of Si+\. The subcomplex C* of antisymmetric cochains 
includes into C* , and this inclusion is a homotopy equivalence (over Q). Note that 
every element of C* vanishes on degenerate simplices. 

Since A is contractible, we have Z l = B % for all i > 0; i.e. every cocycle is a 
coboundary. 

Lemma 5.10. There is an isomorphism of vector spaces Z 2 = Q2/H 1 . Moreover, 
the defect norm on Q2/H 1 is twice the L°° norm on Z 2 . 

Proof. First of all, note that an element of Q2/H 1 is determined by its values on 
tripods of length 1. This follows from Lemma 15.51 since the tripod hyperplanes are 
a complete set of supporting hyperplanes, and since the defect norm on Q2/H 1 is 
nondegenerate. 

An ordered pair of elements s,t 6 S determines a word s~ l t of length 2 in 
F. Insisting that s,t are distinct is equivalent to insisting that s _1 i is reduced. 
Therefore an element of Q2 (i.e. an antisymmetric counting function of length 2) is 
exactly the same thing as an antisymmetric simplicial 1-cochain, i.e. an element of 
C . If is a 1-cochain, then S(j>(s, t, u) = <fi(s, t) + <fr(t, u) + <fi(u, s). If we think of 
s,t,u as the ordered labels on a tripod T of length 1, then 8cf>(s,t,u) — <p(dY,(T)) 
where on the right hand side we think of <j> as a counting quasimorphism. Thus the 
image of <j> in Q 2 / H 1 is precisely determined by its image in Z 2 , and since B 2 = Z 2 , 
every element of Q2/ H 1 arises in this way. Finally, equating defect with twice the 
L°° norm is just a restatement of Lemma 15.41 in this context. □ 

From this we reduce a strong structure theorem for the unit ball in Q2/H 1 : 

Theorem 5.11 (Unit Ball). The codimension 1 faces of the unit ball in Q2/FI 1 are 
exactly the set of length 1 tripod faces. The symmetric group S2k o,cts on this set 
by permutation of edge labels, and the resulting action on Q2/ H 1 is by isometries. 
As a representation of S2k, the space Q2/F1 1 is isomorphic to A 2 V where V is the 
standard (2k — 1) -dimensional representation. 

Proof. The group S^fc acts on A by permutations, and therefore on Q2/FI 1 by 
isometries, by Lemma 15.101 Now, if V denotes the (coordinate) permutation rep- 
resentation of S2k on R 2fc , there is an obvious identification of C° with V and C 1 
with A 2 V. Then Z 2 = ^/SC = A 2 V where V is the complement in V of the 
constant vector, which is the kernel of S : C° — > C 1 . This shows that Q2/H 1 is 
isomorphic to A 2 V as a vector space, and that the action of S^k is irreducible. It 
follows that the rotation quasimorphisms roto span Q2/H 1 . Since every tripod 
face 7Tt contains exactly half of the roto by Lemma 1531 it follows that every tripod 
face is a codimension one face. □ 

Remark 5.12. The group Out(F) acts by isometries on Q/H 1 , and the stabilizer of 
a symmetric generating set S therefore acts by isometries on Q2/ 1 H l . This stabilizer 
is the group Z/2Z fe x Sk, where Sk acts on F by permuting the generators, and the 
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Z/2Z factors act by switching a generator with its inverse. Note that this group 
acts reducibly on Q2/H 1 in the case F has rank 2. It is therefore quite surprising 
that Q2/H 1 admits the full group S2k as isometries; this seems quite mysterious 
from the topological perspective, since such permutations do not even preserve the 
topological type of a fattening! 

Remark 5.13. Since Q2/H 1 includes as a subspace of Qi/H l for each I > 2, and 
since the rotation quasimorphisms are vertices in each Qg/ H , it follows that every 
tripod face ttt is nonempty, and in fact has dimension at least Ik 2 — 3k. On the 
other hand, for i > 3 it is not true that every tripod face has codimension one. See 
Example EH 

5.5. Examples. 

Example 5.14 (Q2 in rank 2). For k = 2 the dimension of Q2/H 1 is 3. The unit 
ball is an octahedron, whose extremal vertices correspond to the six distinct cyclic 
orders on the set of generators and their inverses; see Figure [2] These correspond 
(in antipodal pairs, one for each orientation) to three realizations of F% as 7Ti(S): 
one as the once-punctured torus with boundary representing the commutator [x, y], 
and two as thrice-punctured spheres, in the first case with boundary representing 
the conjugacy classes x, y, XY and in the second case with boundary representing 
the conjugacy classes x, Y, Xy. 




Example 5.15 (Q2 in rank 3). For k = 3 the dimension of Q2/H 1 is 10. The 
unit ball has 120 vertices and 40 faces, which are precisely the set of tripod faces. 
Tripod faces come in antipodal pairs; each face contains exactly 60 vertices, so an 
antipodal pair contain all the vertices. The vertices are exactly the set of rotation 
quasimorphisms associated to basic realizations. 
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Example 5.16 (Q2 in rank 4). For k = 4 the dimension of Q2/H 1 is 21. There are 
112 tripod faces, which come in 56 antipodal pairs. There are 267120 vertices, of 
which 5040 are the rotation vertices. 

What about the 262080 non-rotation vertices in rank 4? Notice that every rota- 
tion vertex takes the value 1/2 or —1/2 on every tripod. It turns out experimentally 
that every vertex of the unit ball in Q2/H 1 in rank 4 takes values in {—1/2, 0, 1/2} 
on every tripod, and in fact this holds for all ranks. 

Proposition 5.17. For any rank, every vertex of the unit ball in Q2/H 1 takes 
values in {—1/2,0, 1/2} on every tripod. 

Proof. This follows from general facts about 2-cocycles on contractible simplicial 
complexes. Recall Lemma [5.101 equating Q2/H 1 and Z 2 of the simplex S whose 
vertices are the symmetric generating set of the free group. Let us be given a vertex 
of the unit ball (f> £ Q2/H 1 . By abuse of notation, we'll also denote the function on 
1-simplices by <f>. Because <f> is a vertex, it is determined by the collection of tripods 
on which it takes 1/2 its defect (which must be 1). Equivalently, 8(f) is determined 
by the 2-simplices on which it is 1/2. We will try to conclude the value of <f» from 
the known values of 5(f). Consider any connected collection K of simplices on which 
the value of 8(f) is fixed (at 1/2). We can set the value of <f> on each 1-simplex in 
K to be or a sum or difference of values of 8<f>, as appropriate to get the correct 
values of 8<j) on K. The fact that 8(f) is a cocycle ensures that this is possible. Now, 
these values for <f> may not be extendable over the entire simplex S, but they extend 
up to a 1-cocycle on K. 

Now take any 2-simplex s on which the value of 8(f) is not ±1/2; to prove the 
proposition, we must show that 8(f>(s) = 0. Note that |5<^(s)| < 1/2, because by 
assumption (f>, being a vertex, takes its defect on a maximal collection of tripods. 
Take a collection K of 2-simplices on which S(f> is ±1/2, such that this collection 
includes the 1-simplices in s. This is possible because the values of <f> must be 
determined by 8<f>. As above, we can assign values to <f> compatibly, which will be 
correct up to a 1-cocycle on K, and these values will be sums and differences of the 
values of 8(f>; that is, they will be in hZ. We can compute 8(f>(s) using these values 
of (f>, so S(f>(s) e ^Z, so we have S(f>(s) — 0, as desired. □ 

A similar phenomenon occurs in rank 2 for £ = 3, and we therefore make the 
following conjecture: 

Conjecture 5.18. Every vertex ofQi/H 1 takes values in { — 1/2,0, 1/2} on every 
tripod of length £ — 1 . 

This does not follow from the proof of the £ = 2 case because for higher lengths, 
the tripods interact in more complicated ways; however, the proof should be similar. 

Example 5.19 (Q3 in rank 2). For rank 2, the unit ball in Q3/H 1 is an 11 dimen- 
sional polyhedron with 168 faces, and 4356 vertices of which 6 correspond to the 
vertices of the unit ball in Q2IH 1 (i.e. to the rotation quasimorphisms associated 
to basic realizations). There are 216 tripods with edges of length 2, so 48 of the 
tripod faces are not of maximal dimension. 

5.6. Generalizations. One natural example to consider is the generalization from 
groups to groupoids. Groupoid generators for free groups arise naturally e.g. by 
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passing to finite index subgroups, since a nontrivial covering space of a graph will 
necessarily have more than one vertex. 

Example 5.20 (Groupoid generators). In the sequel, especially in § [71 we will be 
interested in more complicated graphs X with m(X) — F. Such a graph determines 
in an obvious way groupoid generators for F, and it makes sense to define counting 
quasimorphisms with respect to such groupoid generators. We may define fatgraphs 
over X, and surface realizations compatible with a given graph structure (see § [7] 
for precise definitions) , and the analog of Theorem 15.111 holds in this context with 
essentially the same proof, namely that the codimension one faces of the unit ball 
in Q2/H 1 are exactly the tripod faces "K x ,y,z associated to a cyclic ordering on a 
triple of edges incident to some vertex of X. Moreover, if denotes the valence of 
a vertex v of X, the product (B v S\v\ of symmetric groups acts by permutation on 
the tripod faces, and extends to a linear action on Q2/H 1 . Of course, for each v 
the action of S\ v \ factors through the induced action on the set of cyclic orders on 
triples; so when \v\ — 3 each factors through Z/2Z. 

6. STALLINGS TRAINTRACKS AND SURFACE SUBGROUPS OF DOUBLES 

The virtue of working with traintracks is to replace some (typically infinite di- 
mensional) subcone of B + with a finite dimensional approximation W + . By judi- 
cious choice of traintracks, this lets us reduce certain a priori infinite optimization 
problems to finite linear programming. 

In this section we discuss optimization problems associated to a pair (F, {i^}) 
where F and each Fi is a finitely generated free group, and each f, C f . The 
main application is the Double Norm Theorem, which says that if G is obtained 
by doubling a free group F along a collection of finitely generated subgroups, then 
the unit ball in the Gromov norm on i?2(G;R) is a rational polyhedron, and every 
rational class is projectively represented by a sum of closed surface subgroups. 

6.1. Stallings traintracks. Let F be a finitely generated free group, and F 1 a 
finitely generated subgroup of F. If we fix a free generating set for F, then the 
generators of F' can be expressed as reduced words in F. 

Starting with a (pointed) bouquet of circles whose (oriented) edges are labeled by 
reduced generators for F' , Stallings [ 3"6"] defined a folding procedure which produces 
a new (finite) pointed graph T, with (oriented) edges labeled by reduced words in 
F, in such a way that non-backtracking walks in T starting and ending at the 
basepoint are in bijection with elements of F' , where the element is obtained from 
the walk by reading the edge labels (taking inverse labels when the orientation of 
the walk disagrees with the orientation of the edges) . It is useful to subdivide edges 
if necessary so that each edge is labeled by a single generator of F, and we adhere 
to this convention. 

It is a straightforward and well-known observation that for such a Stallings folded 
graph r, there is a surjective map from the set of non-backtracking loops in T to 
the set of conjugacy classes in F that are conjugate into F', where the surjection is 
obtained by reading edge labels as above. This map is not necessarily a bijection, 
since there might be (homotopically) distinct loops in T with the same labels in 
F. This reflects the fact that there might be distinct conjugacy classes in F' which 
become conjugate in F. 

There are many canonical ways to replace the graph T by a (directed) traintrack. 
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Definition 6.1. Let F' be a finitely generated subgroup of F, and let T be a 
Stallings folded graph for F' . The Stallings traintrack Tr is defined as follows. 

The vertices of Tr are in bijection with the non-backtracking paths in T of length 
2, and the (directed) edges of Tr are in bijection with the non-backtracking paths 
in r of length 3. The endpoints of an edge corresponding to a path of length 3 are 
the initial and terminal subpaths of length 2. 

The label on an edge is the label in T associated to the middle segment of a path 
of length 3. 

Lemma 6.2. There is a natural bijection between non-backtracking loops in T and 
directed loops in Tr . 

Proof. Each non-backtracking loop in F determines a loop in Tr by (cyclically) 
concatenating the successive subpaths of length 3. Conversely, each loop in Tp 
determines a loop in T by taking each edge of Tp to the middle segment of the 
associated path in T. Since the condition that a path be non-backtracking is certified 
by the set of subpaths of length 2, the image is non-backtracking. These maps are 
mutually inverse. □ 

Now let Fi , i<2, • • • , Fk be a finite collection of finitely generated subgroups of 
F. For each i we have a Stallings graph Ti, and we let r = JJ i T; denote the graph 
obtained by disjoint union. Form Tr from T as above. 

We let W + denote the cone of non- negative weights on Tr, and Y + the cone of 
non-ncg ative polygon vectors. Recall (see Definition [373]) that W + = W nR + [E] 
where W — kerd © H : R[E] -> R[V] ® Hi(F). But there is also a rational linear 
map H' : kerd — »• ®iHi(Fi), whose restriction to W has image equal to the kernel 
of the inclusion map @iH\(Fi) — > H\(F). 

The map H' is rational linear, and therefore we conclude the following: 

Proposition 6.3. For each a € ker(® iHi (Fi) — > Hi(F)) define sc1f(o;) to be the 
infimum of sc\p(w) over w conjugate into ®iC + (Fi) representing a in ®iH\(Fi). 
The function sc\f is piecewise rational linear on kei((BiHi(Fi) — > Hi(F)), and 
for any rational a, it is achieved on some rational w a , projectively bounding an 
extremal surface in F. 

Proof. The restriction H' : W + -> kcr((S l Hi(F i ) -> H±(F)) is rational linear. Now 
apply Theorem |3~2"31 □ 

6.2. Doubles of free groups. 

Definition 6.4. Let F be a a group, and Ft, F2, ■ ■ ■ , Tfc a finite collection of finitely 
generated subgroups of F . The double of F along the Fi is the fundamental group 
of the space Z obtained as follows. Take K a K(F, 1), and Ki a K(Fi, 1) for each 
i, and let X be obtained by attaching the mapping cone of the map Ki — > K 
associated to the inclusion Fi — > F. We let Y be the disjoint union of the Ki in 
X, and let Z be obtained from two copies of X by identifying the copies of Y with 
each other by the identity. 

We denote the double by G := D(F, {Fi}). The double is a graph of groups with 
associated graph consisting of two vertices joined by k distinct edges. Note that Z 
as constructed above is a K(G, 1), and therefore we may compute the homology of 
G by Mayer- Vietoris. In particular, we have an exact sequence 

^ H 2 (Z) ^ Ht(Y) ^ Ht(X) 
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or in other words, 

H 2 (G) = ker(© l iJ 1 ( J F l ) -> H^F)) 

6.3. The Double Norm Theorem. From Proposition 16.31 we deduce the Double 
Norm Theorem: 

Theorem 6.5 (Double Norm Theorem). Let F be a free group, and F^ a finite 
collection of finitely generated subgroups. Let G be the double of F along the Fi; 
i.e. G := D(F, {Fi}). Then the unit ball in the Gromov norm on H 2 (G) is a 
rational polyhedron, and every rational class in H2(G) is projectively represented 
by a finite sum of infective closed surface subgroups. 

Proof. Let A e H 2 (G) correspond to a € ker ® i H 1 (F i ) -> H 1 (F). Then by Propo- 
sition self (a) is rational, and projectively realized by some surface S with 
boundary in Y. Doubling this surface gives a closed surface in Z projectively rep- 
resenting A. By construction, this surface realizes the Gromov norm of A; in fact, 
Halloo = 4 self (a). It follows that the components are injective. The rational poly- 
hedrality of the norm also follows from ||j4.||co = 4 self (a) and Proposition 16.31 □ 

If we consider a more complicated amalgam like G = F\ *f 3 F 2 , then A € H 2 (G) 
corresponds to classes a S Hi(F 3 ) in the kernel of inclusions into both Hi(F 1 ) and 
Hi(F 2 ). We get an inequality ||^4||oo > 2(sc1f 1 (a) +sclf 2 (a)), and we can even find 
extremal surfaces S\ and S 2 ; however there is now no guarantee that the boundaries 
will glue up. 

A more refined estimate gives ||^4||oo > mi w 2{sc\F 1 {w) + sc\f 2 {w)) where the 
infimum is now taken over the rational polyhedron of weights w S W + representing 
a in Hii^F^) (in particular, the right hand side of this inequality is piecewise rational 
linear on H 2 (G)). If the support of some infimal w is an embedded 1-manifold in T, 
extremal surfaces on either side can be glued up (after passing to a cover), giving 
rise to an injective surface in G. We are therefore motivated to make the following 
conjecture: 

Conjecture 6.6 (Norm Conjecture). Let G be a graph of groups whose vertex and 
edge groups are free. Then the unit ball in the Gromov norm on H 2 (G) is a rational 
polyhedron, and every rational class is projectively represented by a finite sum of 
injective closed surface subgroups. 

7. ENDOMORPHISMS AND TRAINTRACK DYNAMICS 

Let G be a group, and let (j> : G — > G be an endomorphism. Pre and post compo- 
sition with an inner automorphism of G defines an equivalence relation on the set 
of endomorphisms, and we call the equivalence classes Outer Endomorphisms. If 
x G G determines the inner automorphism I x , given by the formula I x {y) = xyx~ l 
for all y £ G, then <f>I x = I^uAcj), and therefore composition of Outer Endomor- 
phisms is well-defined and associative. 

Let F be a free group. By abuse of notation, we denote the semigroup of Outer 
Endomorphisms of F by End(i r ), and for the sake of brevity we frequently drop 
the adjective "Outer" in the sequel. 
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7.1. Linearizing endomorphisms. We are concerned in the sequel with the ac- 
tion of End(F) on the linear spaces and objects defined in the first part of this 
paper. There is an obvious linear action of End(F) on the spaces C(F) and B(F), 
but without more hypotheses there is no natural way to define 4> : Wg — > Wg so 
that 4> o $^ (c) =$jo (/>(c) for all c £ C . However, <fi is well-defined as a linear map 
between suitable different weight spaces: 

Proposition 7.1. Suppose is injective. Then for any £ there is an £' so that 
there is a well-defined map <fi : We —> Wg in the sense that <fi ° ^i'{c) = &e o </>(c) 
for any c £ C . 

Proof. Since every finitely generated subgroup of a free group is quasiconvex, the 
map <t> : F — >• F is a quasi-isometric embedding. By the definition of quasi-isometry, 
and elementary properties of free groups, there is a constant T so that if u and v 
are reduced, then <p(uv) is obtained from </>(it) and 4>{v) by canceling at most T 
letters from the suffix of <fr(u) and the prefix of 4>(v) (or else we could find points 
arbitrarily far apart in F whose image in <j>(F) is a bounded distance apart). 

It follows that for any word v with length(v) > 2T there is a well defined core 
4> c (v) which is a maximal subword of 4>(v) that is never canceled in the image 
of (j)(uvw) for any reduced uvw. By the definition of T, we have an estimate 
length(0 c (v)) > length (0(i;)) — 2T. Moreover if v is a prefix of v', then <fi c (v) is a 
prefix of 4> c (v'). 

Since <f> is a quasi-isometric embedding, there is some £' with the property that 
length(</>(u)) > £ + 2T — 1 whenever length (u) > £' — 1. Let w be a word of length 

and let v~ be obtained from v by removing the last letter. Notice that <fi c (v~) 
is a prefix of ^ c (w) and satisfies length(</> c (u~ )) > £ — 1. By abuse of notation, for 
t; € Wf< we define 4>{v) £ Wt to be the sum of the (possibly empty) set of subwords 
of 4> c (v) of length £ whose last letter is in the suffix <f> c (v) — <fi c (v~). 

Now suppose w is a cyclic word, and let Wi be successive subwords of length £' . 
The cores <p c (w~) wrap monotonically around the cyclically reduced representative 
of <j)(w), and 4> c (wi) has 4> c (w~) as a prefix and 4'c(u ! ~ +1 ) as a suffix. Since each 
<fi c (w~) has length at least £—1, every subword of 4>(w) of length £ occurs in exactly 
one 4> c (wi) in such a way that its last letter is in <p c (wi) — 4> c (w~). In other words, 
</> o (w) = $£o 4>(w) as desired. □ 

It follows that cj> induces (jf : Qi — > Qi> , and thereby an endomorphism of Q CO unt 
to itself. 

Remark 7.2. Once we have seen that 4> '■ We — > Wi as in Proposition 17.11 is well- 
defined, it follows that it is constant on the Outer Endomorphism class of </>, since 
Wg is spanned by terms of the form <&g>{c) for c £ C, and conjugation acts trivially 
onC. 

Example 7.3. The hypothesis that <f> is injective is essential in Proposition 17. II For 
example, let F = (x,y,z) and define an endomorphism by 4>{x) = x, <fr(y) = y, 
4>(z) = id. Define two cyclic words 

wi. n := xz n yz n xz n yz n and W2, n ■= xz n xz n yz n yz n 

Then $e(w hn ) = <£> e (w 2 , n ) for all £ < n + 2, but $ 2 (<£(wi,n)) ^ $ 2 (^(w2,n))< 

Suppose there is some £ such that length(<fi c (v)) > length(u) for all v with 
length(u) > £ — 1. Then the proof of Proposition 17.11 shows that </> : — ^ 
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is well-defined. Automorphisms (with almost trivial exceptions) never satisfy this 
condition for any I. By contrast, this condition is generic (in a strong sense) for 
endomorphisms. 

Definition 7.4. Fix a free group F of finite rank, and fix a free generating set. 
A random endomorphism of F of length n is the endomorphism determined by 
sending each of the generators to a random reduced word in F of length n, where 
the words are chosen independently and with the uniform distribution. 

Lemma 7.5. Fix a free generating set S for F, and for a given n let <p : F — >• F 
take each generator to a random reduced word of length < n. Then there is a 
constant C > 1 so that with probability 1 — 0(C~ n ) we have length.(0 c (x)) > f for 
each generator x G S, and therefore <f> has a well-defined action on We for all £ > 2. 

Proof. Two random words of length n have a common maximal prefix of order 
0(log n), with probability 1 — 0(C~ n ). Since the cardinality of S is fixed, it follows 
that length(0 c (a;)) > n — O(logn) > 1 with probability 1 — 0(C~ n ), and therefore 
<fi acts on We for I > 2 by the argument of Proposition 17. 1 1 □ 

Theorem 7.6 (Random Isometry). Fix a free generating set for F and let <p be a 
random endomorphism of F of length n. There is a constant C > I so that with 
probability I — 0(C~ n ), the map </> : We —> We is an eventual isometry. That is, 
there is an m ( depending on £ but not on (j) or n) so that \\(j) m+N {w)\\e = \\cj> m {w)\\ t 
for all w &We and all N > 0. 

Proof. Suppose that x,y,z are the labels on a tripod of length £ — 1. Replace 
the edge labels with <fi(x),4>(y),4>(z); the resulting graph can be (Stallings) folded, 
giving rise to a new tripod with edges of length t(n — O(logn)). There are finitely 
many tripods of length £ — 1 , and therefore the action of <f> is eventually periodic, 
acting by permutations on some finite subset of the set of tripods of length I — f . 

This collection of tripods defines a collection of tripod hyperplanes, which cut 
out a polyhedron in Qe/H 1 ; this polyhedron is dual to a polyhedron in We which 
is the unit ball in a (pseudo)-norm invariant under <f>, and whose value on w is the 
eventually constant value of ||</> m (u>)||£. □ 

Remark 7.7. Theorem 17.61 implies that the spectrum of a random <f> on We (where 
it is defined) consists of zeros and roots of unity. This theorem should be compared 
with Theorem 3.16 from |13j which says that a random endomorphism of F is an 
isometry of scl on B/e. 

The following example shows that the action on W2 does not need to be injective. 

Example 7.8. Let F — (x,y), and define 4> : F —¥ F on generators by <j)(x) = xY 
and <f)(y) — yx. We compute 4> c (x) = x, <j) c (X) — X, <f) c (y) — x, and <j) c (Y) = X. 
In this example, for any v, the core 4> c (v) is obtained from the reduced word 4>(v) 
by removing an initial y or terminal Y if there is one, so for example, 

4>{yyXYx) — yxyxyXXYxY and <j) c (yyXYx) — xyxyXXYx 

The action of on W2 is given in coordinates by the following table: 
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XX 




YY 


XY + YX 



There are 8 length 1 tripods in F. We denote a tripod by its outgoing labels, 
in cyclic order. The action of cf> takes X,Y,y — > X, Y, x and y,Y,x — > y, X, x. 
It interchanges X, Y, x and y, X, x; there is a similar action on the four tripods 
obtained by reversing the cyclic orders on these triples. 

7.2. Graph maps. The verbal traintracks relate naturally to the structure of F as 
a group with a fixed generating set. When studying endomorphisms of F geometri- 
cally, it is natural to think of F as the fundamental group of a graph, possibly with 
more than one basepoint. Edges of such a graph become groupoid "generators" for 
F, and it is convenient to generalize the theory of verbal traintracks to this setting. 
This generalization is routine, and leads to no unpleasant surprises. 

Definition 7.9. Let F be a free group. A graph realization consists of a connected 
graph X without 1-valent vertices, together with a conjugacy class of isomorphism 

F -> 7Tl (X). 

An outer endomorphism <j> ; F — >• F induces a well-defined homotopy class of 
self-map X — ¥ X , and conversely. The following terminology is standard: 

Definition 7.10. Let A be a graph without 1-valent vertices. A graph map is a 
map / : X — > X taking vertices to vertices, and edges to (possibly empty) simplicial 
paths without backtracking. A traintrack map is a graph map / : X — > X such 
that /" is a graph map for every positive n. A graph immersion is a graph map 
/ : X — > X which restricts to an embedding on the link of each vertex. 

Notice that a graph immersion is a traintrack map. 

If / : X — > X is a graph map, and 7 is an immersed path in X, then we 
can define fi(j) to be the immersed path (i.e. the geodesic) in X in the relative 
homotopy class of 7(7). Note that it makes sense to define the core f c {"f) (in the 
sense of Proposition [73} to be the maximal subpath of /i(7) which is contained in 
fi{l') whenever 7 is extended to an immersed path 7'. If / : X — > X is a graph 
immersion, then f c {lf) = fi{l) = f{l) for every immersed path 7. 

Definition 7.11. If A is a simplicial graph, we define Ef to be the set of non- 
backtracking directed simplicial paths in X of length I. We define V e x to be the 
set of non-backtracking paths of length I — 1 . In a manner completely analogous 
to Definition Ol we define d : M.[E?] -4 R[V e x ] and h : R[E? ] -4 Hx(F), although 
here the function h is slightly more complicated. In order to assign a homology class 
to a path, we pick a spanning tree in A; the remaining edges give a set of generators 
of 7Ti(A), which are each in turn associated with a homology class in Hi(F). To 
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a path that starts on one of these edges, we assign the associated homology class. 
The other paths are sent to zero. 

Finally, we set = ker[<9 © h]. This is the weight space on the set of verbal 
paths in X. 

Example 7.12. Let X be the standard rose (i.e. a wedge of circles), with a generator 
of F associated with each circle in the identification F = n±(X). Then Wjf is We, 
the previously defined weight space on words of length t. 

Recall the cone B + . The maps $^ : B + — > We are defined by counting cyclic 
subwords. For a graph realization X of F, we get similar maps §f : B + — »■ We 
where $>f (C) is the collection of subpaths of the non-backtracking representative 
of C in A. 

Example 7.13. For X the standard rose, <&f = 5^. 

Lemma 7.14. Forb € B + and for X a graph realization of F there is an inequality 
scl($>f (b)) < scl(6). Furthermore, if every primitive word in the support of b has 
length at most £ — 1, then scl(&f(b)) = scl(6). 

Proof. This is completely analogous to the discussion in Section l4~3l □ 

7.3. Irreducible endomorphisms. 

Definition 7.15. An endomorphism <j> of F is irreducible if and only if there is no 
proper free factor F' of F for which cf> n (F') is conjugate into F' for some positive 
n. 

Note that irreducible endomorphisms are injective. 

Reducibility (i.e. failure to be irreducible) can be detected at the level of a graph 
map / : X — > X representing <fr, namely that there should exist such a graph map 
with no invariant forest whose transition matrix N fails to be irreducible, in the 
sense that there are i,j such that (N n )ij = for all n (see [4], Lem. 1.2). 

Irreducibility is a very natural condition to impose on an endomorphism. Ir- 
reducible automorphisms can be very complicated; on the other hand, Reynolds 
showed that an endomorphism which is irreducible but not surjective is represented 
by a graph immersion: 

Theorem 7.16 (Reynolds [31] , Corollary 5.5). Let <f> : F — >• F be an irreducible 
endomorphism which is not an automorphism. Then <p is represented by a graph 
immersion / 

Corollary 7.17. Let <j) : F — > F be an irreducible endomorphism which is not an 
automorphism. Then there is a graph realization of (f> as f : X —> X so that <j) acts 
naturally on W^ for all I. 

Proof. This is just the observation that a graph immersion (whose existence is 
guaranteed by Theorem 17. 16p satisfies f c (j) — /(t) f° r an immersed paths 7, and 
therefore has length(/ c (7)) > length^) for all 7. Now apply the argument of 
Proposition 17. II □ 

7.4. Surface realizations. 

Definition 7.18. Let F be a free group. A surface realization consists of a compact 
connected oriented surface S, together with a conjugacy class of isomorphism F — > 

7Tl(£). 
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By abuse of notation, we will often let E stand for the realization, although the 
data of the conjugacy class of isomorphism (i.e. the "marking" in the language of 
Teichmuller theory) is an essential part of the data. 

Example 7.19. The basic realizations obtained from a fatgraph structure on a rose 
X defined in § 15.31 are examples of surface realizations. 

A choice of basis for F determines a graph realization whose underlying graph is 
a rose. Just as basic realizations are obtained from a choice of fatgraph structure 
on a rose, surface realizations may be obtained from graph realizations by a choice 
of fattening. Recall the definition (Definition I3.17[) of a fatgraph. 

A fatgraph structure on a graph A determines a canonical "fattening" of X to 
a surface E(X) which contains X as a deformation retract. 

Definition 7.20. If X is a graph realization of F, then a fatgraph structure on 
X determines a surface realization E(X) of F. We say the surface realization is 
obtained from the graph realization by fattening. 

Let E be a surface realization of F. An outer endomorphism (f> : F —¥ F deter- 
mines a homotopy class of self-map E — > E, and conversely. If AT is a rose, edges 
of X correspond to a generating set (i.e. a "basis") for F, and fattenings of X 
correspond to a choice of cyclic order on the symmetrized generating set. 

Definition 7.21. An endomorphism <fi : F — > F is geometric with respect to a 
surface realization E if there is a hyperbolic structure on E with geodesic boundary 
such that the induced homotopy class E — > E is represented by an orientation- 
preserving immersion taking <9E to a collection of geodesies. 

It is convenient to use the language of pleated surfaces; see e.g. [38], § 8.8- 
10 for an introduction. If we fix a hyperbolic structure on E and a (suitable) 
ideal triangulation of E, there is a pleated representative E — > E which takes each 
ideal triangle A in the triangulation to an ideal triangle. An endomorphism is 
geometric (in the sense of Definition I7.21j) if any (equivalently all) of these pleated 
representatives is an immersion — i.e. if it is orientation-preserving on each ideal 
triangle. 

Example 7.22. Let E be a surface realization of F. An automorphism 4* £ Out(F) 
is geometric with respect to E if and only if it is an element of the mapping class 
group Mod(E). 

Fix a finite graph X. Let Fat (A) denote the set of fattenings of A. This is a finite 
set. Fattenings pull back under immersions; hence a graph immersion / : X — > A 
induces an endomorphism /* of Fat(A). 

Lemma 7.23. Let E(A) be a fattening of X invariant under f : X — » X. Then 
the induced homotopy class of map f : E(A) — > E(A) is represented by a pleated 
immersion. 

Proof. We abbreviate E(A) by E in the sequel. 

A fattening determines a planar embedding of the universal cover X of X , and 
thereby a circular order on the set of ends of X. Pick a hyperbolic structure on 
E and an ideal triangulation. Each ideal triangle determines three ends of A, 
and its orientation is determined by the cyclic order on this triple of ends. Since 
the fattening is invariant by hypothesis, a pleated representative takes each ideal 
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triangle to an ideal triangle with the same orientation. Such a map is necessarily 
an immersion. □ 

Theorem 7.24. Let <fi : F — > F be an irreducible endomorphism which is not an 
automorphism. There is a finite power which is geometric with respect to some 
surface realization of F. 

Proof. By Theorem 17.161 there is a graph realization X so that the induced map 
/ : X —> X is a graph immersion. Since Fat(A) is finite, the map /* is pre-periodic, 
and therefore some power f n fixes a fattening. This fattening determines a surface 
realization with respect to which </>™ is geometric, by Lemma 17.231 □ 

7.5. Skinning maps. A pleated immersion / : E — >• E pulls back a hyperbolic 
structure, and thereby induces a skinning map f* : Teich(E) — > Teich(E). If / is 
irreducible this map has no fixed point, since (for any fixed hyperbolic structure on 
E) the length of the geodesic representative of f n (j) is proper for each nontrivial 
homotopy class of loop 7. 

Nevertheless, /* will fix some projective length function £ on conjugacy classes 
in F, corresponding to some degenerating sequence of hyperbolic structures on E. 
Since the length of the boundary geodesies increases without bound, most of the 
boundary is contained in the thin part. The thick-thin decomposition gives E the 
structure of a graph X in an /-invariant way, where edges correspond to long thin 
rectangles with two opposite edges on 9E. By irreducibility, each of the vertices of 
the graph corresponds to a simply-connected component of E. Thus the graph is 
homotopy equivalent to E, and we recover the original graph immersion / : X — >• X 
in the limit. 

7.6. Finite covers. Let / : X — > X be a graph immersion. Then there is a power 
/" which fixes the stars of finitely many vertices pointwise, and takes the star of 
every other vertex into one of the fixed stars. An invariant fattening for /" is the 
same thing as a choice of cyclic ordering on the edges at each fixed star. 

If X' — »• X is a finite cover, then / lifts to /' : X' — > X' if and only if /* (m {X')) is 
conjugate into wi(X'). This is guaranteed, for example, if iri(X') is a characteristic 
subgroup of tti(X). If G is a finitely generated group, a natural class of finite 
index characteristic subgroups is given by fixing an n and defining G n to be the 
intersection of the kernels of all homomorphisms from G to S n , the symmetric group 
of order n. If G is residually finite, f] G n = id. The lift of a graph immersion 
will be a graph immersion. In this way we can construct many fattenings of finite 
covers of X invariant under suitable powers of lifts of /. 

Remark 7.25. Even if / : X — > X is irreducible, it is not typically true that a lift 
/' : X' — > X' will be irreducible. For example, LERF for free groups implies that 
/*(7Ti(X)) is a free factor of iri(X') for some finite cover X' of X; the induced map 
on X' is evidently reducible, taking /*(7Ti(X)) into itself. 

8. Folded surfaces and /-folded surfaces 

Given a free group and an injective self-endomorphism, we can form the asso- 
ciated HNN extension. Such extensions are "generically" hyperbolic and 1-ended, 
and we can ask which ones contain surface subgroups. In this section we introduce 
a class of surfaces in such extensions — the so-called /-folded surfaces — satisfying 
certain combinatorial conditions that ensure they are injective. 
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The first main result in this section is the Folded Feasibility Theorem 18. 71 proved 
in § 18. 2\ which says that there is a fast, practical algorithm to search for /-folded 
surfaces. The algorithm is essentially a linear programming problem in a polygon 
space, very much like the algorithm implicit in the proof of the Traintrack Ratio- 
nality Theorem 13.241 except that we are only interested in the feasibility of the 
problem, rather than finding a solution minimizing some linear functional. 

The second main result is the Random /-folded Surface Theorem, proved in 
§ 18.31 which shows that for a random endomorphism of a free group there exists an 
/-folded surface with probability 1 (in a suitable sense). 

The third main result is the construction of an explicit example of an injective 
surface in Sapir's group, i.e. the extension of F% := (a, b) associated to the en- 
domorphism <j) : a — > ab, b —> ba. This example was found by implementing the 
algorithm described in the Feasibility Theorem, and illustrates its practical utility. 

We also include the results of some substantial computer experiments, which 
(together with the Random /-folded Surface Theorem) constitute some evidence to 
support the conjecture that every one-ended hyperbolic expanding HNN extension 
of a free group contains many surface subgroups. 

8.1. HNN extensions. 

Definition 8.1. Let G be a group and 4> : G — > G a homomorphism. The HNN 
extension (denoted G*^) is the group given by the presentation 

(G,t\ tgt^ 1 = cj)(g) for aU 9 e G» 

There is a canonical (from its structure as an HNN extension) surjection of G^ 
onto Z, sending G to and t to 1. The kernel is normally generated by G. 

Geometrically, <fr induces a homotopy class of self-map from a K(G, 1) to itself 
(by abuse of notation we denote this map also by 0), and K{G c j > , 1) may be obtained 
as the mapping torus: 

K(Gf, 1) := K(G, 1) x [0, l]/(x, 1) ~ {<f>(x), 0) 

In the remainder of this section we will be interested in the case that <fi is an 
injective homomorphism from a free group F to itself which is irreducible but not 
surjective. 

8.2. /-folded surfaces. It turns out that the linear programming techniques from 
§ [3] can be adapted to search for surfaces which are injective without necessarily 
being extremal. Since we are focused on the computer implementation of this 
method, we spell out the combinatorial details. 

Let's assume that we have chosen a graph realization X for which </> is represented 
by a graph immersion / : X — > X (recall that the existence of X is guaranteed if 4> 
is irreducible but not surjective by Theorem 17. 16[) . 

Definition 8.2. A fatgraph Y over X is a graph together with a map tt : Y — > X 
sending edges to edges, together with a fattening Y — > T>{Y) so that the induced 
simplicial map 7r : <9£(Y") — > X is totally geodesic (i.e. it is a simplicial immersion 
of a 1-manifold into X). 

A fatgraph Y is folded if the map to X is an immersion. 

If X is the standard rose for F, a fatgraph over X is the usual kind of fatgraph 
studied in [8], and Y is folded if it is Stallings folded in the usual sense. Stallings' 
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argument implies in general that if Y is folded (for an arbitrary graph realization 
X) the induced map ni(Y) — > tti(X) = F is injective. Observe that the property 
of being folded ignores the fat structure on Y, and is inherited by finite covers. 
Informally, we think of a fatgraph Y over X as a fatgraph whose edges have been 
labeled by edges of X. 

If 7r : Y — > X is a fatgraph over X, then f(ir) : /(Y) — > X is the fatgraph over X 
with the same underlying topological space as Y, so that the map f(ir) is defined 
by the tautological formula f(ir) = f o n, and f(Y) is subdivided so that this map 
takes edges to edges. Informally /(Y) is obtained from Y by replacing edge labels 
by their images under /. 

Definition 8.3. An f -fatgraph Y over X is a fatgraph Y over X together with 
a decomposition of the boundary of the fattening <9£(Y) into submanifolds d~ 
and d + (each a union of components) so that there is an orientation-reversing 
honieomorphism / : d~ — > d + lifting / (i.e. for which irf = fir). 

Given a geodesic 1-manifold S immersed in X, we can ask for an /-fatgraph Y 
such that there is an oriented covering d~ — > S compatible with the immersions to 
X. Informally we say that Y "virtually bounds" S + f(S~ 1 ). 

An /-fatgraph Y (over X) is a fatgraph whose boundary components are paired 
by /• We can use this pairing to glue Y to f(Y) by identifying the image of 
<9 + (E(Y)) to the image of d~ (E(/(Y))). Informally, we denote the result by Y\ := 
Y U /(Y), and similarly define Y n :=FU f(Y) U • • • U f n (Y). 

Similarly we can use / to glue E(Y) to E(/(Y)) along certain boundary compo- 
nents, and denote the resulting surface by Ei(Y) := £(Y) U £(/(Y)), and similarly 
define £ n (Y) := £(Y) U £(/(Y)) U ••• U £(/"(Y)). 

Notice that there are two surfaces associated to Y„, namely £„(Y) (defined as 
above), and the thickening E(Y n ). The next lemma gives a simple condition which 
guarantees that these surfaces are "the same" : 

Lemma 8.4. Suppose that no vertex ofY is associated to more than one vertex in 
d~ . Then X„(Y) is homeomorphic to S(Y„). 

Proof. Each S(/ 4 (Y)) deformation retracts to f l {Y) 1 and the tracks of this defor- 
mation are proper essential arcs which deformation retract to points in the edges of 
Y, and proper essential trees which deformation retract to the vertices of Y. Glue 
up the tracks of the deformation of each S(/*(Y)) to the tracks in each £(/ l+1 (Y)) 
by the identification on the boundary. The result is a decomposition of E„(Y) into 
graphs, such that the space obtained by quotienting each graph to a point is Y n . 
We need to check that each such graph is a tree, so that the quotient map is a 
homotopy equivalence, and therefore that £„(Y) is homeomorphic to E(Y n ). But 
the hypothesis that no vertex of Y is associated to more than one vertex in d~ 
implies that every track r in each E(/ l (Y)) is attached by at most one point (i.e. 
a point of f l (d~)) to E(/ t_1 (Y)), and consequently each such track r deformation 
retracts in a canonical way to this point. By induction, each component of the 
decomposition of E„(Y) deformation retracts to a single track in some E(/ l (Y)) 
(the least i for which the intersection of the component with E(/*(Y)) is nonempty) 
and the conclusion follows. □ 

Now for any /-fatgraph Y we distinguish, amongst the vertices of d + , those 
which are in the image of vertices of d~ under /, and call these f -vertices. 
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Definition 8.5. An /-fatgraph Y over X is f -folded if it satisfies the following 
conditions: 

(1) as a graph over X (forgetting the fat structure), the graph Y is Stallings 
folded; 

(2) every /-vertex in d + is associated to a 2-valent vertex of Y; 

(3) no vertex of Y is associated to more than one /-vertex in d + ; and 

(4) no vertex of Y is associated to more than one vertex in d~ . 

Theorem 8.6. Suppose Y is f -folded. Then the suspension ofH(Y) is injective in 

Proof. By Lemma \8. 41 and condition (4), it suffices to show that Y n — > X is folded 
for all n. But this follows by induction, since the hypotheses imply that each vertex 
of Y n of valence > 2 whose restriction to Y n -i has valence 2 is locally isomorphic 
to some vertex in Y. Since foldedness is local, the conclusion follows. □ 

The main theorem in this subsection is the existence of a fast, practical algorithm 
to find /-folded surfaces. The proof is straightforward, being a variation on the 
algorithms described in § [3] The main point is just that the property of being 
/-folded is local, and all /-folded surfaces with prescribed boundary may be built 
from finitely many pieces. 

Theorem 8.7 (Folded Feasibility). Given 4> : F —> F and a graph realization X 
such that 4> is represented by f : X —} X , and given S an immersed 1-manifold 
in X, there is a polynomial time algorithm to check whether there is an f -folded 
fatgraph Y over X which virtually bounds 6 + /(<5 _1 ). 

Proof. First we show that there is a polynomial time to check whether there is a 
folded fatgraph Y over X which virtually bounds 5 + f(5^ 1 ). 

We define finite sets of edges E, arcs A and polygons P, and linear maps Da : 
R[P] -> A[A] and D E : R[P] -> R[E] as in §|H By definition there is a simplicial 
immersion h : S U /(<S _1 ) — > X taking oriented edges to oriented edges, which plays 
the role of the evaluation function. There is an involution t on oriented edges of X 
which acts by reversing the orientation. An edge is an oriented edge of S + / (<5 _1 ). 
If e, e' are oriented edges of 6 + f(S~ 1 ) which map by h to the same edge of X but 
with opposite orientation, we say that the pairs of vertices of e, e' mapping to the 
same vertex of X are an arc, and all (unoriented) arcs arise in this way. Thus a 
rectangle R (in the sense of Definition ^. 12p is associated to each such pair e, e', and 
Da{FL) = a + a' (where a, a' are the associated oriented arcs), and De{R) = e + e'. 

A polygon is either a rectangle as above, or a cyclic list of arcs, satisfying the 
following properties. Firstly, we require that each polygon should have at least three 
arcs. Each arc is (for some orientation) the boundary of a rectangle. Secondly, we 
require that for each arc a that the arc obtained by reversing the orientation is in 
the boundary of some rectangle. The rectangle may not be unique, but it will map 
by h to a unique edge of X. Finally we require that the the edges associated to a 
polygon by this method are distinct from one another. 

Let D := D A © D E :R[P] ->• A[A] © R[E], and let y be a non- negative integral 
vector in the kernel of D. Then we can build a surface from rectangles and polygons 
by gluing them along arcs. The result is evidently a fatgraph Y over X, and the 
conditions on polygons imply that the map to X is an immersion. Hence the 
underlying graph is folded. Conversely, each such folded fatgraph over X with 
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boundary in S + f(6 1 ) gives rise to a non- negative integral vector in the kernel of 
D. 

To ensure that the resulting fatgraph is /-folded, we first to restrict the class of 
rectangles, so that each rectangle has at most one edge on 8. Secondly, we need to 
restrict the class of (non-rectangle) polygons, so that of the rectangles glued to a 
given polygon, at most two of them have an edge on S, and these edges are adjacent. 
Thirdly, we need to insist that no (non-rectangle) polygon can have an arc which 
ends on an /-vertex, and no arc can both start and end at an /-vertex. 

These conditions just restrict to explicit subsets A' C A and P' C P, and 
any vector y as above gives rise to an /-folded fatgraph. Conversely, any /-folded 
fatgraph determines a vector with polygons in P' and arcs in A' , so we are done. □ 

8.3. Random HNN extensions contain /-folded surfaces. Recall (Defini- 
tion 17.4)) that a random endomorphism of F of length n is obtained by sending the 
generators (in a fixed generating set) to random reduced words of length n. We 
refine this definition slightly as follows: 

Definition 8.8. A random reduced endomorphism of F is a random endomorphism 
4> conditioned on the fact that is represented by a graph immersion of the standard 
rose. 

We remark that there is a uniform e > such that a random endomorphism of 
F of length n is reduced with probability > e, independent of (sufficiently large) n. 

The main theorem of this subsection is that for any fixed k > 2, the HNN 
extension associated to a random endomorphism of Fk of length n contains an 
essential surface subgroup, with probability going to 1 as n goes to infinity. 

Theorem 8.9 (Random /-folded Surface). Let k > 2 be fixed, and let F be a free 
group of rank k. Let <f> be a random endomorphism of F of length n. Then the 
probability that F*^ contains an essential surface subgroup is at least 1 — 0(C~ n ) 
for some C > 1 and c > 0. 

The remainder of this subsection is devoted to the proof of Theorem 18.91 
If <j) is a random reduced endomorphism, the essential surface subgroup can ac- 
tually be taken to be /-folded in the sense of Definition 18.51 In the general case 
we must show how to generalize the definition of /-folded when / is not neces- 
sarily an immersion, and show that the surface we construct can be taken to be 
/-folded in this new sense. For the sake of clarity we postpone the discussion of this 
generalization, and first prove the theorem under the hypothesis that <j> is reduced. 

Notice that we do not specify the chain S for which S + /(<5 _1 ) virtually bounds 
an /-folded surface. The fact is that we can taken any (nontrivial) homologically 
trivial chain for S. To see why, let w be a homologically trivial cyclically reduced 
word, and let v = ^>(w _1 ). With overwhelming probability, we can find a copy 
of in 4>{a) for any generator a appearing in w, and use this to cancel w in 
some partial fatgraph. Similarly, if u is a subword of v of length 2 containing an 
/-vertex /-vertex, we can (again with overwhelming probability) find a copy of u^ 1 
somewhere in (j>(a), and glue these pairs of segments in our partial fatgraph. Since w 
is fixed but n — > oo, this shows that the /-folded condition more or less reduces just 
to the problem of finding a folded fatgraph with boundary a random homologically 
trivial reduced word of length ~ n. For the sake of simplicity however we will give 
the proof for the word w — abAB (note our convention of writing a -1 as A and so 
on). 
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8.3.1. Building Y by inductively pairing segments. A fatgraph Y bounding w + v 
will be obtained by decomposing w + v into disjoint subwords which will be glued 
(along opposite sides of an edge of Y) in mutually inverse pairs. We will describe 
how to build Y by inductively specifying pairs of subwords to glue. After some but 
not all of these subwords have been glued, we have a partial fatgraph, consisting 
of the union of glued segments, together with a collection of unglued cyclic words, 
which may be thought of as a chain in B\(F). In order to control the local structure 
of the resulting fatgraph (to ensure /-foldedness), we need to keep track of some 
extra data in the unglued cyclic words, namely the labels on edges of the (partially 
glued) fatgraph that have a vertex on an unglued edge. We will ensure as we 
glue that there is at most a single terminal edge of the partially glued fatgraph in 
incident to an unglued segment, and we call this terminal edge a tag. Colloquially, 
we refer to the unglued collection of words together with this data as a chain with 
tags. Thus, the problem of finding a /-folded fatgraph bounding a given chain can 
be reduced, by gluing up a pair of inverse subwords, to the problem of finding a /- 
folded fatgraph bounding a (simpler) chain with tags. Pairs of inverse segments in 
a chain of tags may themselves be identified, but in order to ensure the /-foldedness 
condition, we must be careful that the resulting vertices in the fatgraph satisfy the 
necessary conditions spelled out in Definition 18.51 The data of the labels on the 
tags is enough to determine this. 

We introduce the following notation. If v is a segment appearing somewhere in 
a cyclic word, we write this as -v-. We indicate a tag in the middle of a segment by 
■v\. z V2', where z is the letter on the tag oriented to point into the segment. Suppose 
we have segments -wipvi- and -v^Pw^- for inverse subwords p, P. Let z\ be the last 
letter of P, and Z2 the last letter of p. Then in this notation, after gluing p to P 
we create segments ■w\. Zl W2- and ■V2. Z2 v i'- 

8.3.2. First step: reducing f -foldedness to foldedness. Here and in what follows, we 
let x = 4>{a) and y = (f)(b), and use X = x^ 1 and Y = y • Since by hypothesis 
4> is represented by a graph immersion, the word 4>{abAB)~ l = yxYX is cyclically 
reduced. We write x = x L x'X R and y = yhV'yR where \x L \ = \x R \ = \y L \ = \vr\ = 
log(n) 2 , which makes sense provided n » log(n) 2 . 

There are 4 /-vertices in yxYX, one between the last letter of y R and the first 
letter of xl, and so on. Let a be the segment of yxYX of length 2 containing the 
last letter of y R and the first letter of Xl- With probability 1 — 0(C~ n ) we can 
find a copy of cr -1 in Xl, and we can pair these segments. We do something similar 
with each of the 2-letter segments containing an /-vertex. Finally, we look for a 
copy of -pbaBAP- in xl (say), disjoint from the eight subwords paired so far, for 
some letter p, and glue the p and P to create a loop labeled baBA. p , and then glue 
this to the loop abAB. The result of this gluing is to produce a new chain with a 
.p tag, which differs from yxYX only in the segments xl, xr, yhi yn- It is evident 
that finding an /-folded surface bounding abAB + yxYX reduces to finding a folded 
surface bounding this new chain with tags. 

8.3.3. Second step: folding off short loops. We now describe a procedure to modify 
the subwords y',x',Y',X' contained in y,x,Y,X as above (note that these en- 
tire subwords survive intact to the chain with tags at this stage). Except for the 
constraint that yxYX is cyclically reduced, it is almost true that y' and x' are 
independent random words of length n — 2 log(n) 2 in the uniform distribution. We 
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use the following trick to ignore the cyclically reduced constraint. If y, x were inde- 
pendent random reduced words of length n in the uniform distribution, there would 
still be a uniformly positive probability (> 5% chance, say) that yxYX is cyclically 
reduced. If we can prove that some property of y and x holds with probability 

1 — 0(C~ n ), then even after conditioning on the cyclically reduced constraint, the 
property will still hold with probability 1 — 0(C~ n ) (at the cost of adjusting the 
implicit constants involved). So we are justified in arguing as though y' and x' are 
independent random words in the uniform distribution, and are therefore produced 
by a stationary Markov process. 

We perform the following process. As we read off the letters of y' one by one, 
we look for a segment of length 11 of the form -vipuPv2-, where v\ ^ v^ 1 and p, P 
are letters, and \u\ = 7. Then the p and P can be glued, producing the segment 
■V\.pV2- and the loop u. p . We call u. p a short loop, and we call this the operation 
of folding off a short loop. A short loop is determined by a cyclically reduced word 
u, a choice of a "basepoint" for u, and a letter p not equal to the last letter of u or 
to the inverse of the first letter of u. This data (u,p) is called the type of a short 
loop. Let Li,k denote the number of types of short loop of length 7 in a free group 
of rank k, so for example L7.2 = 4376. 

After folding off a short loop, we skip ahead a fixed number of letters in y' (say 

2 letters), and then examine the segments of length 11 one by one until we get 
another chance to fold off a short loop. If we fold off every short loop that we can 
by this process, we see that different types of short loops will be produced with 
different probabilities, but since there are only a finite number of different types 
of short loops, the ratio of these probabilities are bounded. For a given short loop 
type £, let m be the relative probability that a short loop generated by this process 
is of type £, and let /io be the minimum of this collection of relative probabilities 
(note that the vector of /z* is only well-defined up to scale). We adjust the process 
of folding off short loops as follows: whenever we have the opportunity to fold off 
a short loop of type £, we do so with probability jj,o/ 1 \xt. 

Explicitly, we start with -y' ■ a random word generated by a stationary Markov 
process, and successively fold off short loops of type £ each time they appear with 
probability fxo/ f-e, or leave them intact with probability 1 — /io/ He., arid the result 
of this process is a new segment -y" ■ of length 0(n) together with a collection £ y 
of short loops, also of cardinality 0(n). 

Observe the following two facts. Firstly, the tagged word y" is itself generated by 
a stationary Markov process. Secondly, by the Chernoff inequality, with probability 
1 — 0(C~" C ), for every type £ the number of loops of type I is \L y |/X7 j fe±0(n 1 / 2+£ ). 
We perform similar folding-off processes for x',Y',X' to produce tagged words 
■x"-, -Y"-, -X" ■ and collections L y ,Lx,£>Y- It is not important to ensure that 
X" = (x")^ 1 , but we can enforce this if we wish. Let's denote the union of the 
by£. 

8.3.4. Third step: random pairing of long subwords. Notice that for any letters 
V\,V2 that there is some choice of p, u so that -v\puPv2- is reduced. It follows that 
if we fix some big constant N that every reduced word of length N will appear with 
definite frequency as a subword of y" (and likewise x", Y", X"). Moreover, for any 
V\,V2 there are at least two possibilities for p as above, and each possibility occurs 
with equal probability. 
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Let's subdivide y" and Y" into successive subwords whose lengths alternate 
between N and either 100 or 101 (the latter chosen randomly and independently 
with equal probability). The successive (tagged) subwords of length N can be 
thought of as being generated by an aperiodic ergodic Markov chain. For each word 
a of length N we can consider the collection of possible tags that could appear, 
and call this a tagged copy of a. A tagged copy of a and a tagged copy of cr _1 are 
compatible if they can be glued in such a way that the result is (locally) folded. For 
the sake of dcfinitcness, wc define an involution t on the set of tagged words so that 
for any tagged word a, the words a and i(o~) are compatible. First, the underlying 
word of i(a) should be cr _1 . Second, for each tag -vi. u V2- we distinguish two cases. 
If u = Vi we set i^vi.v^-) = -Yi-v^Yv- Otherwise, we set l{-v\. u V2-) — -V^.t/Vr. 

By construction and the discussion above, for every tagged subword ct, the ex- 
pected number of copies of a in y is equal to the expected number of copies of 
l(<j) in Y", and similarly for x" and X" . Again by the Chernoff inequality, we 
can glue compatible pairs of subwords in y" and Y" with C^n 1 / 2- ^) exceptions, 
with probability 1 — 0(C~ n ). After this gluing we are left with a tagged chain z 
of total length approximately 100n/A ', plus the collection L of (almost perfectly 
equidistributed) short loops, which has total length > C.n for some fixed constant 
C (independent of N) . If we choose N so that N 3> C then the total length of the 
short loops is much, much greater than the total length of z. 

Remark 8.10. The requirement that tags must be compatible when gluing inverse 
words forbids us from simply gluing y" to Y" and x" to X", assuming we had 
arranged for Y" = (y") _1 an d X" = (x")^ 1 . This is why we need to be a bit 
careful in defining i so that a and l(ct) are compatible. 

8.3.5. Fourth step: building Z . At the next stage we show how to partially glue up 
some (relatively small) subset of the short loops to produce a tagged copy of Z (i.e. 
the chain inverse to z) whose tags are compatible with the tags of z. We can then 
glue z to Z and be left with only short loops. Since the total mass of z is so much 
smaller than that of £, the remaining collection of short loops will still be almost 
equidistributed . 

The basic construction step is as follows. Suppose that we have two loops vpqr 
and v'RQP where v and v' are words, and p, q, r are letters. We can glue pqr to 
RQP to produce the tagged loop . r v.pv'. Note that for any two letters p and r 
there is some reduced word pqr of length 3, so the resulting tags can be anything 
(subject, of course, to the word vv' being cyclically reduced). 

We abbreviate a word of length d by the number d, so that a short loop may just 
be abbreviated by .7 (the dot preceding the 7 is the tag). A subsegment of such a 
word will be indicated by brackets, hence 4(2.1) indicates a short loop with a choice 
of specific segment of length 3 with the tag in its interior. Denote the operation of 
gluing by U and the result by — h Then, providing labels are compatible, we can 
perform an operation such as 



4(2.1) U 4(2.1) 



4.4 



See Figure [3] 



Consider the following sequence of gluings. 

4(2.1) U 4(2.1) .4.4 : 
.3(1.2)2 U 4(2.1) .3.4.2 



.3(1.2)2 
: .1(2.1)3.2 
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Figure 3. The meaning of the operation 4(2.1) U 4(2.1) ->• .4.4 



.1(2.1)3.2 U 4(2.1) -> .1.4.3.2 = .1.3(1.2)1.2 
.1.3(1. 2)1. 2 U 4(2.1) ->■ .1.3.4.1.2 = .1.1(2.1)3.1.2 

and so on. If we choose suitable short loops as the 4(2.1) summands at each stage, 
the result of m gluings will be a loop of length 7 + m representing an arbitrary 
reduced word and with arbitrary tags. Since the total length of z is so small 
compared to that of L, we can build a copy of Z from roughly \Z\ short loops, 
with tags compatible with the tags of z, and then glue Z to z. We have therefore 
reduced to an almost equidistributed collection £' of short loops. We introduce 
the following terminology. We say that a collection L' of short loops (in some Fk) 
is equidistributed with error e if for any two types £, £' the ratio of the number of 
short loops of type I to the number of type £' is between 1 — e and 1 + e. With 
this terminology, we have reduce the problem of exhibiting an /-folded surface to 
folding a collection L' of short loops in Fk which is equidistributed with error C/N. 
By choosing N sufficiently large (for fixed C) we may arrange for the error, which 
we abbreviate to ei, to be as small as we like. 

8.3.6. Fifth step: linear programming. Let denote the set of types of (tagged) 
short loops in Fk, and let denote the cone of non- negative linear combi- 

nations of elements in this set. The subspacc of finite linear combinations which 
bound a folded surface is a closed rational convex subcone of this space, contained 
in the subspace of linear combinations representing in homology. Call this subset 
the feasible cone. We claim that the projectivization of the feasible cone contains 
an open neighborhood of the uniform vector (i.e. the vector all of whose compo- 
nents are equal) in the projectivization of the subspace representing in homology. 
This will show that with overwhelming probability L' as above is in the feasible 
cone, and therefore we will conclude the proof of Theorem I8.9[ under the stronger 
hypothesis that 4> is reduced. 

For any fixed k, by Theorem 18.71 this claim reduces to finding finitely many 
vectors in R+p^] whose projective hull (i.e. cone) contains the uniform vector 
in its interior, and testing the feasibility of a specific linear programming problem 
associated to each vector. In other words, it reduces to a finite computation for 
each k. But there is a trick to reduce the computation for arbitrary k to the case 
of fc < 8. First, observe that for any short loop type £, a pair £ + l(£) bounds 
(in an obvious way), where i is the involution defined in the third step. Now, any 
short loop £ in Fk has at most 8 distinct letters, and therefore is contained in some 
copy of Fg. Let t>g denote the uniform vector in F&, and suppose that we can 
show feasibility for all homologically trivial chains in an open neighborhood of v$ 
g]. Let n : F 8 — > Fk denote the fc!/8!(fc — 8)! various inclusions of F s into Fk 
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determined by choosing 8 out of k generators, and let v' = 8!( - fc fc 7 8 ^ Y^i Ti(vg) denote 
the image. It is not quite true that v' is equal to the uniform vector Vk, but both v' 
and Vk are vectors with all coordinates positive, and which are invariant under the 
involution i. It follows that if some open neighborhood of v s is feasible, then an 
open neighborhood of v' is feasible, and an open neighborhood of v' + w is feasible 
for any (-invariant non-negative vector w; in particular, an open neighborhood of 
Vk is feasible. In the sequel we call this trick of adjusting a vector by an (.-invariant 
vector the i trick. Applying the t trick to reduce from rank k to rank 8 will magnify 
the error by a bounded amount, so we are reduced to the problem of finding a folded 
surface bounding a collection of short loops in F 8 which is equidistributed with error 
£2, where e 2 depends on ei and k. Nevertheless, for fixed k, as ei — > so too does 
62 -> 0. 

However, even with this reduction, the size of the problem is far beyond what is 
feasible on a modern computer. We now describe how to reduce the problem from 
F$ to F 2 in a series of steps, each of which magnifies the error of the equidistribution 
by a bounded amount. 

First, we reduce from Fg to F4. For each loop type £ that uses at least 5 
generators, the loop type l{£) also uses at least 5 generators. By equidistribution, 
we pair as many such loops as we can, and are left with e 2 Cn loops containing 
at least 5 generators, and (C ± e 2 )n of each type of loop containing at most 4 
generators, for some C. If £ is one of the remaining loops with at least 5 generators, 
write £ = st. u v where \s\ = 3, \t\ = \v\ = 2 and \u\ — 1. At most 4 generators 
appear in t and v, so we can find a short loop of the form V. w Txxx where -V. W T- 
is compatible with -t. u v-, and where x or X is one of the 4 generators used in t 
and v. By hypothesis we have many more copies of V. w Txxx than of st. u v, and 
after gluing -V. W T- to 4. u v- we obtain a tagged loop of the form sxxx which uses 
at most 4 generators. By the method of Step Four we may fold this sxxx up 
with a finite collection of short loops using at most 4 generators. In this way we 
eliminate all loops with at least 5 vectors, and are left with loops containing at most 
4 generators. This collection of loops may be partitioned equally into subsets, one 
for each choice of 4 out of the 8 generators. Each subset is now not equidistributed, 
but is equidistributed modulo an (-invariant vector, up to bounded error. So by 
the 1 trick we may reduce to a collection of short loops which is equidistributed in 
F4 with error e 3 . Again, 63 depends on e 2 in such a way that £3 — > as e 2 — >• 0. 

Next, we reduce from F 4 to F 3 . Again, try to cancel each £ using 4 generators 
with a copy of t(£) until we are left with only e 3 Cn such loops. Let I be a remaining 
loop using 4 generators, say a, b, c, d. Let £' be a short loop obtained from £ by 
replacing each d or D by one of a, A, 6, B. Then a copy of i(£') will cancel most of 
a copy of £, leaving a collection of loops that use only the generators a, 6, d. Cancel 
these loops using the method of Step Four. After another application of the 1 trick 
we reduce to an equidistributed collection of short loops in F 3 with error e 4 . 

Finally, we reduce from F 3 to F 2 . This step is nearly the same as the previous 
step. We try to cancel each £ using 3 generators with a copy of i(£) until we are left 
with only e^Cn such loops. Let £ be a remaining loop in the generators a, b, c, and 
let £' be a short loop obtained from £ by replacing each consecutive string of cs or 
Cs by a consecutive string of as or 6s etc., and then using a copy of i(£') to cancel 
most of a copy of £. What is left over may contain all three generators a, 6, c, but 
each component of what is left over is either of the form c*a* or c*b* and therefore 
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uses at most two generators. Each component therefore may be canceled by the 
method of Step Four. After one final application of the u trick we reduce to an 
cquidistributed collection of short loops in F2 with error 65 . This is finally within 
the reach of practical computation. 

Determining if a point lies in the interior of the cone on a set of vectors can be 
phrased as a linear programming problem by using the following lemma. 

Lemma 8.11. Let x,Vi, . . . ,Vk G R n . If the Vi span K.™ and there is an expression 
x = tiVi with ti > for all i, then x lies in the interior of the cone spanned by 
the Vi. 

Proof. To show that x lies in the interior, it suffices to show that x is not contained 
in any supporting hyperplane. Thus, let H be any supporting hyperplane for the 
cone spanned by the vi. Because the Vi span R", there is some j so that Vj is not 
in H . We have expressed x = ^ i ijUj with, in particular, tj > 0. Therefore, if we 
decompose R™ = H © spanj-Wj}, and express x in this decomposition, we will find 
that the coefficient of Vj is not zero, so x is not contained in H. □ 

Using Lemma 18.111 we observe that if we are given the Vi as the columns of the 
matrix M, and a; is a column vector, then a feasible point y for the problem Ay = x, 
y > 1 provides a certificate that x is contained in the interior. Feasibility testing 
can be phrased as a linear programming problem by setting the objective function 
to zero. We remark that the lower bound on y is arbitrary; if x is in the interior, 
the linear program will succeed for some lower bound, but we don't know a priori 
what it is. 

The fifth step of the proof therefore reduces to 

(1) Find a collection of vectors V in the feasible cone which together span the 
space of homologically trivial linear combinations of E. 

(2) Show that the uniform vector lies in the cone on V. 

Both steps require linear programming: in order to check that a vector lies in the 
feasible cone, we solve a linear programming problem to find a folded surface, and to 
show that the uniform vector lies in the cone on V, we solve the linear programming 
problem derived from Lemma 18.111 

In order to make the many linear programming problems in step 1 feasible, 
we need to choose low-density vectors; that is, vectors with a small number of 
nonzeros. Recall there are 4376 short loops of length 7 and rank 2, and the space of 
homologically trivial linear combinations has dimension 4374. We found a collection 
of 9626 vectors, each with 8 nonzeros, which span this 4374-dimensional space. This 
required solving a few tens of thousands of small linear programming problems, 
which was easily accomplished by GLPKpO]. using our program gallop[39|. As we 
built this collection, we occasionally ran a much larger linear program to determine 
if the uniform vector was contained in the cone (not necessarily in the interior, as 
that is more difficult to solve in practice). Once our cone did contain the uniform 
vector, we ran one final linear program to verify that it lay in the interior. 

Even though these latter linear programs had only a few thousand columns 
and a few thousand rows, they proved quite difficult in practice. Fortunately, the 
proprietary software package Gurobi 25J, which offers a free academic license, was 
able to solve them in a few minutes. We used Sage [37] to facilitate many of the 
final steps. 
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8.3.7. Sixth step: unreduced automorphisms. We now show how to adapt our argu- 
ment to the general case of a random endomorphism which is not assumed to be 
reduced. 

Definition 8.12. Let (f> : F — > F be an endomorphism, and let A be a real number 
satisfying < A < 1. We say </> is X-reduced if, for any two non-inverse letters a, b 
(with the possibility that a = b) the maximal suffix of <f>{a) canceling a prefix of 
4>(b) has length < \\<j)(a)\. 

Remark 8.13. Note that since we can take B, A in place of a, b we also deduce the 
length is < A|0(b)|. 

Remark 8.14. The condition of <fi being 1/2-reduced implies (in the notation of 
§ [7]) that length(0 c (a;)) > 1 for each generator i£S, and therefore that <j> has a 
well-defined action on We; compare with Lemma 17.51 

Now, suppose 4> is 1/2-rcduced. Let 8 be a cyclically reduced word, let (^(S^ 1 ) 
denote the unreduced cyclic word obtained by concatenating the words obtained 
by applying cj> to the letters of 8, and let 8' denote the word obtained by reducing 
We think of (^(S^ 1 ) as a simplicial loop, and as 8' as a smaller simplicial 
loop obtained by folding pairs of canceling subwords. There is a simplicial map 
from <5 -1 to ^{S" 1 ) taking each edge to a union of edges (i.e. to a copy of <f>{a) for 
each letter a and so on); the images of the vertices of <5 _1 are what we previously 
called /-vertices. 

The condition on (f> implies that each vertex of 8' is in the image of at most one 
/-vertex, so it makes sense (by abuse of notation) to refer to these vertices of 8' 
as /-vertices. With this definition it makes sense to define an f -folded fatgraph Y 
(over the standard rose) to be a fatgraph bounding 8 + 8' as above, and satisfying 
the conditions in Definition 18.51 (where 8 = d~, 8' = d + , and the /-folded vertices 
are those with the new definition). 

If Y is an /-folded fatgraph, then (j)(Y), the fatgraph obtained by replacing each 
edge a of Y by the corresponding word <f)(a), will not typically be folded, but by the 
1/2-reduced condition, the result of canceling maximal pairs of adjacent subwords 
in the image of adjacent letters will be a new fatgraph which is Stallings folded, 
in the usual sense. Call this new fatgraph <fi'(Y). Then we can glue <p'(Y) to Y 
along 8' , and as before we see that the result is still Stallings folded. Thus we have 
proved the following analog of Theorem 18.61 

Theorem 8.15. Let <fi be 1/2-reduced, and suppose Y is f -folded. Then the sus- 
pension of Yi(Y) is infective in F*^. 

The proof of Lemma 17.51 implies that a random endomorphism of length n is 
C. log n/n- reduced, with probability 1 — 0(C n ), for suitable constants C,C',c. 
So the method of the previous few sections applies essentially without change to 
produce an /-folded surface in this new sense. This concludes the proof of Theo- 
rem 18.91 in full generality. 

8.4. Sapir's group. 

Definition 8.16. We define Sapir's group C to be the HNN extension of F2 := (a, b) 
by the endomorphism <f> : a — > ab,b —> ba. 
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In [33) . Problem. 8.1, Sapir posed explicitly the problem of determining whether 
C contains a closed surface subgroup, and in fact conjectured (in private communi- 
cation) that the answer should be negative. This group was also studied by Crisp- 
Sageev-Sapir and (independently) Feighn, who sought to find a surface subgroup or 
show that one did not exist. Because of the attention this particular question has 
attracted, we consider it significant that our techniques are sufficiently powerful to 
give a positive answer: 

Theorem 8.17. Sapir's group C contains a closed surface subgroup of genus 31. 

Proof. The theorem is proved by exhibiting an explicit /-folded surface. Figure [4] 
indicates a fatgraph whose fattening has four boundary components, three of which 
are (conjugates of) babaBABA and the fourth of which is 4>(babaBABA~ 3 ). The 
blue circles mark the babaBABA components. By taking a 3-fold cover we obtain 
a fatgraph whose fattening has six boundary components, three of which are con- 
jugates of babaBABA 3 , and three of which are conjugates of cf>(babaBABA~ 3 ). In 
the HNN extension F*^ we can glue these boundary components in pairs, giving a 
closed surface S together with a map iti(S) — > F*^. 

The surface is /-folded, and therefore the resulting map of the surface group is 
injective. To see this, note that the babaBABA components are disjoint from each 
other, the underlying fatgraph is Stallings folded, and the /-vertices (indicated in 
red) are all 2-valent. □ 

Remark 8.18. In fact, Sapir expressed the opinion that "most" ascending HNN ex- 
tensions of free groups should not contain surface subgroups, which is contradicted 
by the Random /-folded Surface Theorem l8.9l On the other hand, the probabilistic 
estimates involved in the proof of this theorem are only relevant for endomorphisms 
taking generators to very long words, and therefore Sapir's group seems to be an 
excellent test case. 

8.5. Experimental results. The Random /-folded Surface Theorem 18.91 implies 
that surface subgroups of HNN extensions of free groups are very common, and 
implies that for sufficiently large n (depending on k) an HNN extension associated 
to a random endomorphism of length n is virtually certain to contain one. However, 
the precise meaning of "sufficiently large" implicit in the argument is far beyond 
the reach of computer experiment. On the other hand, the construction of an /- 
folded surface subgroup of Sapir's group suggests that such subgroups are even 
more plentiful than our argument certifies, and may be found even for n of modest 
size. In this section we describe experimental results for small n which complement 
the asymptotic conclusions of Theorem 18. 91 

In order to naively use the algorithm in the Folded Feasibility Theorem 18.71 
to search for /-folded surfaces it is necessary to represent a given automorphism 
<j) : F — > F by a graph immersion / : X — > X. For ease of computational efficiency, 
it is very convenient to consider maps for which X can be taken to be the standard 
rose; however, we weaken the condition that / : X — > X is a graph immersion to 
the condition that c (cc) is nonempty for every generator x, where (j) c denotes the 
core of <j> (see Proposition 17. II for a discussion of cores). This means precisely that 
whenever xyz is a reduced word of length 3, there are letters from <p(y) remaining in 
the reduced word obtained from the product (f>(x)(f>(y)(/)(z). Note that this implies 
that (f> is well-defined on We for all I > 2. 
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FIGURE 4. A fatgraph bounding 3 • babaBABA + ^(babaBABA^ 3 ) 



Now fix F = (a, b). Let E denote the set of all endomorphisms of F of length 
3 (i.e. taking each generator to a reduced word of length 3) and satisfying the 
following properties: 

(1) for every generator x we have |</> c (x)| > 1; and 

(2) no power of 4> sends a generator to a conjugate of a power of itself. 

Note that the first condition ensures that \4>(w)\ > \w\ for every conjugacy class w, 
so (by the second condition) <p can't be an automorphism. The second condition 
is implied by irreducibility. The cardinality of E is 752. For each </> 6 E, we 
determined whether there is an /-folded surface bounding C — <fi n (C), for 1 < m < 4 
and for C one of the chains a + b + AB or abAB. Continuing the search to m = 5 is 
not computationally feasible. However, it is possible to perform a randomized search 
for /-folded surfaces in this case: for each endomorphism, we computed C — <fi 5 (C) 
and did many random foldings; that is, we simply glued C — (f> 5 (C) to itself along 
many compatible segments. The resulting (tagged) chain is short enough that an 
exhaustive search for /-folded surfaces can proceed. Of course this method may 
fail to find an /-folded surface bounding C — <j) 5 {C) even if one exists, but it is a 
reasonable way to continue. For the randomized search, we tried many different 
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m 


surface found 


fraction (out of 752) 


1 








2 


52 


0.069 


3 


492 


0.654 


4 


636 


0.846 


5* 


676 


0.899 



Table 1 . Number of <j> £ E for which there is an /-folded surface 
bounding C - (f> m (C) with C = abAB or a + b + AB. For m = 5 
the search was randomized. 



chains C. See Table Q] for the summary of results. Here <fi has "no surface found" 
for some power n if we could not find an /-folded surface bounding C — <j> 1 (C) for 
i < n for any C which we tried. So the first line of Table [T] means that we could 
not find any /-folded surfaces using the first power of any endomorphism. 

In a second experiment, we vary the length n of the endomorphism (i.e. the 
length of the words <fc(x) as x ranges over the generators), and compute the proba- 
bility of "easily" finding an /-folded surface. Explicitly, for each value 5 < n < 24 
we generated approximately 5000 random endomorphisms of length n. For each 
such </>, we searched for an /-folded surface bounding C — <j>(C) for C = [a, b}. Note 
this is a very naive search; as our previous experiment shows, taking powers and 
a variety of chains C produces far more /-folded surfaces. Nevertheless, we still 
observe that the probability of finding an /-folded increases quickly. 



■0.997 



-I — t t t t — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I 

5 n 24 

Figure 5. The experimental fraction of endomorphisms 4> of 
length n for which [a, b] — 4>([a, b)) bounds an /-folded surface. 



On the basis of this experimental evidence, together with the theoretical results 
from this section and the next, we feel therefore reasonably confident in making the 
following conjecture: 

Conjecture 8.19. Let : F — >• F be an irreducible endomorphism which is not 
an automorphism. Then for some power (f> m of <j) and some chain C there is an 
f -folded surface bounding C — 4> m (C) . In particular, the HNN extension F*^ should 
contain many closed surface subgroups. 



9. Extremal surfaces and flat surfaces 

In this section we are concerned with finding surface subgroups of HNN exten- 
sions by methods more closely related to the theory of stable commutator length, 
and the techniques developed in §EHH We distinguish between four classes of injec- 
tive surface subgroups in an HNN extension, related to each other by the following 
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diagram of inclusions 

flat > /-folded 

I 1 

extremal > injective 

Here we can take /-folded either in the narrow sense of Definition 18.51 for an im- 
mersion /, or in the broader sense of /-folded for / merely 1/2-reduced as in 
Definition Hm 

The properties of /-foldedness and of extremality are both certified by linear 
programming. An /-folded surface carried by a fatgraph Y over X is flat if there is 
an /-invariant fattening of X which pulls back under Y — > X to the given fatgraph 
structure on Y. 

Extremal surfaces and flat surfaces are (by definition) homologically essential; 
thus no HNN extension can contain such a surface unless 4> fixes some vector in 
Hi(F). We reduce the question of such surfaces, therefore to two intermediate 
questions. Firstly, when does an HNN extension F*^ have virtually positive &2? 
And secondly, when is some class in H2(F* ( f > ) (assuming one exists) represented by 
an extremal surface? 

We cannot give firm theoretical answers to these questions; however, the tools 
that we have developed give us the ability to test these questions experimentally, 
and to quickly certify the existence of extremal or flat surfaces. One interesting 
aspect of this story is the existence of certain homomorphisms to M defined on 
subsemigroups of End a (F), the stabilizer of a homology class a £ Hi{F). Such 
homomorphisms are associated to (counting) quasimorphisms in Qi . In the special 
case of rotation quasimorphisms, the associated homomorphisms are seen to be 
closely related to a particular summand of the Johnson homomorphism, defined 
on the Torelli subgroup of the mapping class group; see Proposition 19.71 and the 
related discussion in § 19.31 

9.1. Nontrivial homology in finite index subgroups. In this subsection we 
consider the question: given an endomorphism <f> of F, when is there a finite index 
subgroup F 1 of F and a power <f>' of <f> taking F 1 into F 1 such that <\>' fixes some 
nontrivial vector in Hi(F')? 

If we fix F free of rank k, the probability that a random endomorphism of length 
n fixes a particular basis vector in Hi is of order n~ k l 2 for fixed k and n — > oo, and 
of order k~ n l 2 for fixed n and k — > oo, in each case subject to the parity condition 
that n should be odd (the precise asymptotics are obtained by Sharp; see [35]). If 
F' is a subgroup of F of index m, then the rank of F' is mk — m + 1; thinking 
of the length of a random endomorphism as (approximately) the stretch factor, we 
can think of a lift of <f> to F' (if one exists) as an endomorphism of length of order 
n. Of course, the lift is no longer a random endomorphism, and different lifts to 
different F' are certainly not independent. However this heuristic suggests that 
it should be relatively easy to find nontrivial fixed elements in Hi in finite index 
subgroups of F. In fact, the probability of finding such fixed vectors should go to 
1 exponentially fast in the index. At least for small values of n, this heuristic is 
borne out in practice. 

Fix F = (a, b), and let E denote the set of endomorphisms 4> of length 3 for 
which no power of <f> sends a generator to a conjugate of a power of itself. Then 
\E\ = 848. For each tf> G E, we searched for finite index subgroups F' of F with 
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4>{F') C F' and such that cf> fixes some nontrivial vector in Hi(F'). We find that in 
every case there is such a finite index subgroup with index at most 10. For length 
4 endomorphisms, of which 11, 132 satisfy our condition, the number without such 
a finite index subgroup through index 11 is 328, or 2.9%, at which point continuing 
the computation becomes infeasible. 



1 T 



—1.0 1 T 



0.971 



H 1 1 1- 







H 1 1 1- 



i 10 i 11 

Figure 6. The fraction of length 3, left, and length 4, right, irre- 
ducible endomorphisms <f> for which there is a subgroup of F^ with 
finite index at most i and with nontrivial Hi . The plot on the left 
reaches 1 at i = 10; the plot on the right docs not quite. 



9.2. Gromov norm. Let G be a semigroup acting linearly on a vector space C, 
and preserving a subspace B. There is a quotient action of G on H := C/B. If 
a G H and G a is the stabilizer of a, then there is an affine action of G a on a + B. 
This action has a global fixed point only if there is some a G C in the class of a 
fixed by G a . 

The sequence B — > C —> H of G^-modules gives rise to a long exact sequence in 
cohomology: 

H°(G a ; B) -> H°{G a ; C) -> iJ°(G Q ; H) -> H 1 (G a ;'B) -> H 1 ^^ C) ->• • • • 

For any G a -module V, the group -ff°(G a ;F) is equal to V Ga , the subspace of V 
fixed pointwise by G a . There is a tautological class a G iJ (G a ;H), whose image 
in /^(G^B) is nontrivial unless it comes from some invariant a G H°(G a ;C). 
Cocycles c a representing the class in H 1 (G a ;'B) are in bijection with elements 
a G C representing a, by the formula c a (g) — ga — a. Note that this class becomes 
trivial in H l {G a ;C). 

Now let's specialize to the case that G = End(i^) for F a free group. We can take 
C, B, H to be G/e, B/e, Hi as in § [5] If we choose some nontrivial a G Hi(F) and 
let End a denote the subsemigroup fixing the class a, then for each 4> G End Q there 
is a natural isomorphism between ^(-F 1 *^) and Hi(F)^, the subspace of Hi(F) 
fixed by </>. 

Remark 9.1. Since is represented (with respect to some basis of Hi(F;Z)) by 
an integer matrix, the subspace of Hi(F;M) fixed by cj>* is defined over Q. 

If there is a (pseudo)-norm || • || on B invariant under G, then the action of G a 
on a + B is an affine isometric action. Now, the action of End Q (F) on B/e with 
the scl norm is 1-Lipschitz in general, but in fact for random endomorphisms <fi the 
action is actually isometric; see [T3], Thm. 3.16. 

Remark 9.2. The Random Isometry Theorem 3.16 from |13) actually proves that 
the probability that a random endomorphism of length n is an isometry goes to 1 
like 1 — G~™. For a given integral homology class a G Hi(F) the probability that 
<f> G End(F) preserves a is of order vT k l' 1 where k is the rank of F. Thus, even 
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conditioned on <f> lying in End Q (F), the probability that <fi is isometric goes to 1 
exponentially fast in n. 

The translation length t(<P) of G F,nd a (F) on B/e is equal to the infimum of 
scl(c — 4>{c)) over all representatives c of the class a. By properties of translation 
length, we may also express this as a limit 



for any c € B/e representing a. 

We now express this translation length in the language of so-called Gromov 



Definition 9.3. Let X be a topological space, and let A G H*(X\ M). The Gromov 
norm of A, denoted ||A||, is the infimum 



where the infimum is taken over all (singular) cycles ti°~i representing the ho- 
mology class A. 

Now, H*(X;R) = H*(X;Z) ® K. In dimension 2, H 2 (X;Z) is generated by 
maps from closed, oriented surfaces S into X, and H2(tti(X);Z) is obtained by 
quotienting out the image of Tt2{X) under the Hurewicz map. 

Lemma 9.4 (See e.g. [8], p. 11). For A G H2(tti(X);Z) there is an equality 



where the infimum is taken over all positive integers n, all closed oriented sur- 
faces S without sphere components, and all maps f : S —> X representing nA in 



With this definition we have the following lemma: 

Lemma 9.5 (Gromov norm is translation length). Let A G H2(F* ! j ) ) be the class 
corresponding to a € H\{F)^ . There is an equality \\A\\ — 4r(0) where || • || denotes 
the Gromov norm. 

Proof. Let / : S — > K(F* t p, 1) represent nA for some n. We think of K (F, 1) as a 
horizontal slice of K(F*$, 1) as in ij l8.ll and put f(S)C\K(F, 1) in general position. 
Then f(S) H K(F, 1) is a 1-cycle c representing a and we obtain an inequality in 
one direction. The inequality in the other direction is obtained by finding a surface 
/ : S — >• K (F, 1) with dS representing some multiple of c — 0(c), and gluing this up 
in the suspension to a closed surface representing a multiple of A. □ 

A surface / : S — s> X representing nA is extremalii —2x(S)/n — \\A\\\. Extremal 
surfaces do not necessarily exist, but if they do they are injective: 

Lemma 9.6. Extremal surfaces are tti -injective. 

Proof. If / : S — > X is not injective there is a finite index cover S' of S with a 
simple loop in the kernel (this existence of such a cover follows from LERF for 
surface groups; see [34]). Then /' : S' — >• X can be compressed along this simple 
loop, producing /" : S" — > X with strictly lower norm. Contrapositively, if S is 
extremal, no such virtual compression is possible. □ 



t(4>) = lim -sc\{c-(j) n {c)) 



n— >oo n 



norms. 




ff 2 (7n(X);Z). 
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9.3. Extremal surfaces and counting quasimorphisms. Now let Te denote 
the verbal traintrack of length t. Recall from Definition 13.31 the linear operators 
d : R[E] -> R[V] and H : R[E] -4 Hi, and the subspace We = kerd © H. Define 
Wi = ker d and observe that there is a short exact sequence 

We-tWe-t Hi 

By Lemma 17.51 a random endomorphism of length n has a well-defined action on 
Wi, with probability f — 0(C~ n ). The same argument in fact gives an action on 
We- Again, conditioning on fixing some vector a G Hi, a random endomorphism 
in End Q (F) has a well-defined linear action on Wi and we obtain therefore a well- 
defined affine linear action on We- Since B/e — > Wi is 1-Lipschitz for scl, it follows 
that the translation length Te(<t>) of cj> on We (for the affine action) is a lower bound 
on t{4>). 

Recall Theorem 17.61 which says that for and I there is a fixed m so that for 
<f> G End(.F) a random endomorphism of length n, with probability f — 0(C~ n ) 
the map cf> : <fr m (Wg) — > 4> rn (Wt) is an isometry. In fact, the subspace on which it 
acts isometrically is dual to the family of tripod hyperplanes permuted by <j>. Now, 
conditioning on (f> fixing a, the conclusion of the theorem still holds with probability 
f — 0(C~ n ) (at the cost of adjusting the constant C). Because the unit ball in the 
norm on the fixed subspace is polyhedral, there is an m depending only on I so that 
for random <f> 6 End(-F), the map (f> m : We —> We is projection to one of finitely 
many possible fixed subspaces, and therefore random elements of End Q (_F) can be 
partitioned (nonuniquely) into finitely many subsemigroups End^i, each of which 
maps to an affine group of the form X G where G is finite. There is thus a 
subsemigroup of finite index in End Q ,i(-F) whose image in G is finite, and which 
maps homomorphically to the translation group M. N . As we explain in the next 
subsection, this homomorphism generalizes one of the summands of the Johnson 
homomorphism. 

In any case, for each random <f> E F<nd a (F) the power (j> m acts affinely on We 
by the combination of projection to some subspace, followed by translation of that 
subspace, realizing the translation length on some further subspace. Dual to this 
subspace is a fixed subspace of Qe/H 1 , and the suspension of any class in this 
subspace defines an element of bounded cohomology H%(F*$) positive on A. 

9.4. Euler class and rotation quasimorphism. Throughout this section we fix 
a free group F, and an element a € Hi(F). Let F — > 7Ti(S) be a realization of F 
as the fundamental group of a compact, oriented surface S, which we suppose to 
have a hyperbolic structure with geodesic boundary. 

Let End Qj s(-F) denote the semigroup of outer endomorphisms of F fixing a, 
and geometric with respect to the realization S. Thus elements of End Q , s(i ;l ) 
are represented by immersions with geodesic boundary of S into itself, fixing the 
homology class a. 

Proposition 9.7. Let a G Hi(F;Z) be primitive. There is a homomorphism 
p a ,T. ■ End Qj s(-F 1 ) 2 defined as follows. If c is any cycle representing a, and rots 
is the rotation quasimorphism associated to the realization of F as 7Ti(E), then 

Pa,s{4>) = rot s (c - 0(c)) 

Proof. Since elements of 7Ti(£) fix points in S^, it follows that rots takes values 
in Z (on integral chains). 
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By the linearity of rots , to show that p is a homomorphism it suffices to show 
that it is well-dcfiried; i.e. that rots(c — </>(c)) does not depend on the choice of a 
representative c for the homology class a. If we choose a different representative 
d then we need to show rots(c — c') = rots(0(c — c')). But rots ° 4> = </>*rots = 
rots , since rots depends only on the cyclic order on the ends of F induced by the 
realization, and any geodesic immersion preserves the cyclic order. □ 

Note that if we fix <fi and vary a, the function p.^((j>) defines a homomorphism 
from ffi(S)^ to Z. Identifying i?i(E)^ with ^(F*^), we obtain in this way an 
element of H 2 (F* (t> ). 

Definition 9.8. The cohomology class in H 2 {F*^) whose value on A £ ^(F*^) 
associated to a G Hi(F)^ is p Q! s(0) is called the Euler class associated to the 
invariant realization E. 

An automorphism of F which is geometric with respect to the realization E is 
nothing but an element of the mapping class group of E. If we denote by Mod a (E) 
the subgroup of the mapping class group fixing a, we thereby obtain a homomor- 
phism p a to Z. The Torelli subgroup 3(E) is defined to be the subgroup of Mod(E) 
fixing all of iZi(E). The homomorphisms p a over all a can be put together into a 
single homomorphism p : 3(E) — > if^E), which is one summand of the well-known 
Johnson homomorphism. In fact, the relationship of this summand to rotation 
quasimorphisms (in the language of Chillingworth [15] as discussed in Remark 15 .8[) 
can already by found in Johnson's original paper |26j . 

If <9E is empty or has one component, p : 3(E) — » _ff x (E) is surjective. In general, 
the image is equal to the image of ifi(E) under contraction (i.e. cap product) 
with the intersection form u> 6 A 2 i/ 1 (E). In geometric terms, p a is nontrivial on 
Mod Q (E) if and only if there is /3 with uj(a, /3) ^ 0. However, p a is always nontrivial 
on End Q ,s(F). 

Example 9.9. Suppose there is a simple, separating geodesic 7 representing a (for 
example, 7 could be a boundary component of S if dS has at least 2 components). 
Let 7' be an immersed geodesic homologous to 7 with rots (7 — 7') = 1- To see 
that there is such a 7', first take any curve homologous to 7, and insert a suitable 
power of [a, b] into a cyclic word representing the conjugacy class, where a, b are 
any elements of 7Ti(E) generating a subgroup equal to the fundamental group of 
some immersed once-punctured torus. 

Now, there is a cover E' of E containing an embedded lift of 7', and since 7' is 
immersed, it separates E' into two nontrivial pieces. There is a further finite cover 
E" of E' to which 7' still lifts, where the pieces on either side have arbitrarily large 
genus. Then the components of E — 7 embed into the components of E" — 7', and 
composition with the covering projection E" — > E determines an immersion of E 
to itself taking 7 to 7'. 

In a similar way one can show that p a ,n always surjects onto Z. 
9.5. Flat surfaces. 

Definition 9.10. Let <fi £ End Qi s(-F). A flat surface is an immersion S — > E with 
geodesic boundary, so that OS = 7 — 0(7) for some (immersed) geodesic 1-manifold 
7 representing a. 
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A flat surface glues up in the suspension to produce a closed surface mapping 
to K(F* c f > , 1). By abuse of notation, we also refer to this closed surface as a flat 
surface. 

Lemma 9.11. A flat surface is extremal, and therefore ni-injective. 

Proof. To see that a flat surface is extremal, it suffices (by Lemma I9.5j) to see that 
scl(7 — 0(7)) is the infimum of scl(c — 4>{c)) over all c representing a. But this is 
certified by Bavard duality and the existence of the extremal dual quasimorphism 
rot s . □ 

The next lemma shows how to find flat surfaces in practice. The point of this 
lemma is that both sides of the formula can be computed efficiently in practice. 

Lemma 9.12. There is a flat surface S with dS — 7 — ^(7) if and only if scl(7 — 
0(7)) = irot E (7-<K7))- 

Proof. This is immediate from the definitions. □ 

A flat surface has Gromov norm zero if and only if it is a torus; this corresponds 
precisely to the case that <fi preserves a (homologically essential) conjugacy class in 
F. Otherwise the Gromov norm of such a surface will be positive, and therefore 
Pa,Y.{4>) must be nonzero. Conversely, providing <f> does not preserve a nontrivial 
conjugacy class (for example, if <fi is irreducible), the vanishing of p a ,T.{4>) is an 
obstruction to the existence of a flat surface representing the given homology class. 

The definition of flat surfaces makes sense even when <f> is an automorphism of 
E. In this case the suspension := E x [0, l]/(x, 0) ~ (</>(x),l) is a 3-manifold 
fibering over the circle. The Euler class of the fibration (i.e. the Euler class of 
the plane field of tangent planes to the surface fibers) agrees with the Euler class 
as given in Definition 19.81 explaining the name. 

Lemma 9.13. Let (f> G Mod(E), and let M^, be the 3-manifold obtained by suspen- 
sion. A homology class A 6 H2(M$) is represented by a flat surface if and only if 
the homology class A projectively intersects the boundary of the fibered face of the 
Thurston norm ball in Hz^M^jdM,/,) associated to e<$>. 

Proof. Integral classes in the cone on the fibered face correspond precisely to 
Thurston norm-minimizing surfaces S for which is extremal. These are ex- 
actly the surfaces which can be isotoped to be transverse to the suspension flow. 
Such a transverse surface is flat. Conversely, flat surfaces are norm-minimizing, and 
certified by e^. □ 

Agol 1 showed that for any 3-manifold M whose fundamental group satisfies a 
technical condition called residually finite rational soluble (usually abbreviated to 
RFRS), for any nontrivial A £ H^iM, dM) there is a finite cover of M for which the 
transfer of A is projectively in the closure of a fibered face. The RFRS condition 
is now known (by work of Wise [42]) to hold for all 3-manifold groups with if 2 
positive; it follows that every norm-minimizing surface is virtually flat. 

Example 9.14. Let S be a genus 2 surface with one boundary component. Let T be 
an embedded once-punctured torus, and let 7 be a nonseparating essential simple 
closed curve in S — T . Form 7' by connect summing 7 to dT along an embedded 
arc disjoint from either. Then 7 — 7' bounds an embedded twice-punctured torus 
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T". Since 7 and 7' are both nonseparating, there are many <j) G Mod(S') taking 7 
to 7'. For any such <f>, the surface T' glues up to a flat surface in the suspension. 

Example 9.15. Let F = (x,y) and <f>{x) — xYxY, <p(y) = yyXYx. Then 0([y]) = 
[y]. Let E be the basic surface realization given by the cyclic order xyXY . One 
can check that <f> is geometric with respect to E. Another way to show this is to 
simply check that rots(<^([y, x])) = 1, since [y, x] is the boundary of this realization, 
and this shows the image surface is immersed. Finally, we check that rots(y — 
yyXYx) = 1 and sc\(y — yyXYx) = 1/2. By Lemma f9.12[ any extremal surface 
for y — yyXYx is flat. Using scallop [14] (made easier by wallop (40]), we can 
find a particular example: a genus 1 surface with boundary yy + XyxYY XyxYY . 
A set of generators for this surface group is {yy, Xyyx, XXyx,t}, where recall 
F* = (x, y, t 1 txt' 1 = <f)(x), tyt' 1 = <f>(y)) 

Example 9.16. Let F — (x, y, z), and let <fi : F — > F be defined 

x i-» yZAFz, y i-> ZZAz.x, z h-> zYXzX. 

Then </> is geometric with respect to the realization E given by the cyclic order 
acCbAB, as described above, and 4>([x] + [ZZ]) = [x] + [ZZ]. We also have scl(a; + 
ZZ + Zyxzy + xZxyZxyX) = 3/2, and rots(a; + ZZ + Zyxzy + xZxyZxyX) = 3. 
Therefore, any extremal surface for this chain gives a norm-realization surface. 
For example, there is a genus zero surface (punctured sphere) with 8 boundaries 
which bounds twice the chain. Gluing this surface produces a closed genus 4 norm- 
realizing surface in F*<j,. 

Example 9.17 (Flat branched surface). Suppose <f> : F — > F is geometric with 
respect to some realization, so that it is realized as an immersion / : S — > S. If 
we fix a hyperbolic structure on S, we can insist that / takes dS to a collection of 
geodesies in S. The immersed surface f(S) lifts to an embedding in a finite cover 
S — > S, and the difference 5' := S — f{S) is an essential surface, immersing in S, 
and bounding ndS — f(dS) for some positive integer n. Some finite cover of the 
surface S' glues up in the HNN extension to produce an oriented branched surface 
S" with the property that every oriented surface carried by S" is flat (note that 
such oriented surfaces necessarily have boundary). Now, %i(S") contains closed 
surface subgroups, but these surfaces are not oriented compatibly with S". Is it 
possible that some of these surface subgroups map injectively to F*^? 

We end our discussion of extremal and flat surfaces with two natural questions. 

Question 9.18. Let F*^ be an HNN extension associated to an endomorphism <f>. 
Is the unit ball in the Gromov norm on ^2(^*0) a finite-sided rational polyhedron? 
Is every rational class projectively represented by an extremal surface? Is every 
extremal surface dual to an extremal quasimorphism obtained from an invariant 
class in some Qi ? 

Question 9.19. Let F*^ be an HNN extension associated to an endomorphism <f>. 
Is every homology class in H2(F* c / ) ) virtually dual to the Euler class of some virtual 
invariant realization? 

9.6. Experimental results. For concreteness, consider endomorphisms of length 
at most 5 in a rank 2 free group. There are 25, 544 endomorphisms with non- 
elementary image, satisfying |^ c (^)| > 1 for each generator x, and fixing some 
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primitive vector in homology. By Theorem 17.241 these are all virtually geometric. 
For computational reasons, we focus on the 6, 136 for which 2 is geometric. Of 
these, we find 940 which contain certified flat surface subgroups. 
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